Dkt. No. 2368/24 

CRYSTAL STRUCTURES OF P- SELECTIN, 
P- AND E-SELECTIN COMPLEXES. AND USES THEREOF 

5 This application claims the benefit of U.S. Provisional Application 

No. 60/205,875 filed May 19, 2000. 

Field of the Invention 
The present invention relates to the crystal and three dimensional 
10 structures of the lectin and EGF-like (LE) domains of P-selectin, the crystal and 
three dimensional structures of P-selectin LE and E-selectin LE each complexed 
with SLe^, as v\^ell as the crystal and three dimensional structure of P-selectin LE 
complexed with a functional PSGL-1 peptide modified by both tyrosine sulfation 
and SLe^. These structures are critical for the design and selection of agents 
15 that interfere with the cellular rolling of leukocytes in the inflammation process. 

Background of the Invention 
The selectins are a family of cell-surface glycoproteins responsible 
for early adhesion events in the recruitment of leukocytes into sites of 

20 inflammation and their emigration into Ijrmphatic tissues (reviewed in (Kansas, 
1996) and (Vestweber and Blanks, 1999)). As part of a multistep process 
(Springer, 1994), selectins promote the initial attachment (tethering) and 
subsequent rolling of leukocytes over vessel walls where they become activated 
as a consequence of exposure to locally produced chemokines. Firm adhesion of 

25 the leukocytes mediated by integrins precedes their extravasation into the 
underlying tissue. P-selectin (CD62P) and E-selectin (CD62E) are induced on 
the surface of vascular endothelium in response to inflammatory stimuli. P- 
selectin, also expressed by activated platelets, is translocated within minutes 
from intracellular stores to the cell-surface following induction by inflammatory 

30 mediators. E-selectin is transcriptionally regulated and appears within a few 
hours of activation of the vascular endothelium. L-selectin (CD62L), a third 
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member of the selectin family is expressed constitutively on leukocytes. In 
addition to its role in inflammation, L-selectin mediates the attachment of 
lymphocytes to specialized high endothelial venules in the course of their 
migration from the blood to Ijnnphoid tissues. 
5 The selectins share a number of structural and functional 

properties. They consist of a highly homologous N-terminal calcium-dependent 
(C-type, (Drickamer, 1988)) lectin domain, an epidermal growth factor (EGF)- 
like domain, variable numbers of complement regulatory-like units, a 
transmembrane domain, and an intracellular region. It is generally accepted that 

10 selectin binding is mediated predominandy through weak protein-carbohydrate 
interactions between the lectin domain and glycan ligands on apposing cells. A 
number of diverse glycan structures have been described as able to support 
and/or to inhibit selectin binding. However, an epitope displayed by the sialyl 
Lewis^ (SLe^, NeuNAc2,3Gall,4 [Fucl,3] GlcNAc) tetrasaccharide and related 

15 structures appears to be a physiologically relevant recognition component 
common to all three selectins (Foxall et al., 1992). Additional factors for 
recognition have been suggested by the isolation of specific glycoprotein 
counterreceptors of apparent high affinity (or avidity) yet of diverse structure. 
These include GlyCAM-1 (Lasky et al., 1992), MAdCAM-1 (Berg et al., 1993), 

20 CD34 (Baumheuter et al., 1993), ESL-1 (Levinovitz et al., 1993), and PSGL-1 
(Moore et al., 1992); (Sako et al., 1993), However, there is only limited 
evidence that any of these heavily glycosylated proteins are in fact essential 
raising the possibility that it is the cell- or tissue-specific glycosylation 
capabilities (i.e., ability to produce SLe^-like glycans), and not the expression of 

25 specific glycoproteins, which ultimately confers selectin reactivity. The clear 
exception is PSGL-1, a mucin-like homodimeric glycoprotein expressed by 
virtually all subsets of leukocytes (reviewed in (Yang et al., 1999) and (McEver 
and Cummings, 1997)). While it was anticipated that P-selectin recognition 
would be dependent upon SLe^-like modifications of glycans within a mucin-like 

30 region of PSGL-1, an essential binding epitope was localized to the anionic, N- 
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terminal portion of the polypeptide backbone well outside of the mucin domain 
(Pouyani and Seed, 1995; Sako et al., 1995; Wilkins et al., 1995). Numerous 
studies have demonstrated that P-selectin mediated binding and in vivo 
inflammatory responses can be greatiy diminished by targeting this polypeptide 
5 determinant within PSGL-1. (McEver and Cummings, 1997; Yang et al., 1999). 
Moreover, recent studies of mice rendered genetically deficient in PSGL-1 have 
been described that show significant defects in P-selectin mediated leukocyte 
rolling and inflammatory recruitment (Yang et al., 1999). Therefore, PSGL-1 
represents the single example of a selectin counterreceptor for which both 

10 polypeptide and SLe^-modified glycan components are required for 
physiologically relevant binding. 

In light of their unique function as mediators of cell attachment 
and rolling under the influence of shear stresses encountered within the 
vasculature, considerable efforts have been directed towards characterization of 

15 the underlying biophysical and molecular bases of selectin interactions. The 
selectins associate and dissociate with their ligands with rapid binding kinetics 
(Alon et al., 1997; Alon et al., 1995) and it is this property which appears 
responsible in part for their ability to mediate transient tethers and the cellular 
rolling phenomenon. Other factors including the mechanical (Alon et al., 1995; 

20 Puri et al., 1998) and unique structural properties (Chen et al, 1997) of selectin 
interactions also appear to be involved, but these remain incompletely 
characterized owing to a lack of high-resolution molecular structures of the 
selectins complexed with physiological ligands. Significant efforts have been 
made to overcome this limitation but to date these remain incomplete. An X-ray 

25 crystal structure of an E-selectin construct containing the lectin and EGF 
domains (lec/EGF) has been previously described (Graves et al., 1994) and, 
combined with site-directed mutagenesis studies (Kansas, 1996), suggested a 
putative SLe^ binding site localized to the lectin domain. Models for SLe^ 
binding to the E-selectin crystal structure have been proposed (Graves et al., 

30 1994; Kogan et al., 1995; Poppe et al., 1997) based upon molecular docking of 
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the free or bound solution structures of SLe^ ((Poppe et al., 1997) and 
references therein) using the X-ray crystal structure of the homologous rat 
serum mannose-binding protein (MBP-A) bound to oligomannose (Weis et al., 
1992) as a guide. However, structures of E-selectin or other selectins, 
5 complexed with SLe^-like carbohydrate ligands have not been determined 
confirming these hypotheses. In fact, prior to the present invention, such a 
crystal could not be obtained due to the blocking effect of high concentrations of 
calcium used to form the selectin crystals. 

To date, crystal structures of MBP-A mutated to include E-selectin 

10 residues (the K3 mutant) co-complexed with SLe^ and related glycans (Ng and 
Weis, 1997) represent the only direct information of how selectins might bind 
their ligands. Collectively, these models and experimentally determined 
structures support an SLe^ binding motif in which two hydroxyl groups of the 
Fuc moiety ligate the lectin domain-bound calcium and additional binding 

15 interactions are perhaps mediated by the hydroxyl groups of Gal and the 
carboxylate moiety of NeuNAc. However, the models and MBP-A K3 
mutant/SLe^ crystal structure differ in the orientation of SLe^ within the binding 
site and in the identity of molecular contacts. 

An understanding of the structural basis for the high-affinity P- 

20 selectin/PSGL-1 interaction, is also incomplete and complicated by the 

observation that recognition is based upon both carbohydrate and polypeptide 
components. Mutagenesis studies have focused on the N-terminus of PSGL-1 
and have demonstrated that P-selectin recognizes simultaneously one or more 
tyrosine sulfated residues within an anionic region of the PSGL-1 polypeptide 

25 and an SLe^-containing O-glycan potentially localized to this region (Pouyani 
and Seed, 1995; Ramachandran et al., 1999; Sako et al., 1995; Wilkins et al., 
1995). While the putative SLe^ binding site is anticipated to be similar between 
selectins and perhaps involved in the binding of the SLe^ component of PSGL-1 
by P-selectin, the identity and structural basis of the P-selectin domain 

30 mediating the interaction with the PSGL-1 polypeptide are unknown. L-selectin 
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has also been shown to recognize PSGL-1 (Kansas, 1996; Vestweber and Blanks, 
1999) in the context of neutrophil-neutrophil interactions perhaps important for 
the ampHfication of leukocyte recruitment to sites of inflammation. This 
interaction is also affected by mutations within the N-terminus of the PSGL-1 
5 polypeptide (Ramachandran et al., 1999) suggesting that P- and L-selectin have 
common binding requirements. In contrast, E-selectin binding to PSGL-1 or to 
other SLe^-presenting counterreceptors has not been demonstrated to require 
polypeptide components. 

10 Summary of the Invention 

The present invention provides a crystal of lectin and EGF-like 
(LE) domains of P-selectin ("P-selectin LE"), as well as the three dimensional 
structure of P-selectin LE as derived by x-ray diffraction data of the P-selectin LE 
crystal. Specifically, the three dimensional structure of P-selectin LE is defined 

15 by the structural coordinates shown in Figure 2, ± a root mean square deviation 
from the backbone atoms of the amino acids of not more than 1.5 A. The 
structural coordinates of the three dimensional structure of P-selectin LE are 
useful for a number of applications, including, but not limited to, the 
visuaUzation, identification and characterization of various active sites of P- 

20 selectin LE, including the site in which SLe^ binds. The active site structures 
may then be used to design agents which interact with P-selectin LE, as well as 
P-selectin LE complexed with SLe^ PSGL-1, or related molecules. 

The present invention also provides a crystal of P-selectin LE 
complexed with SLe^ as well as the three dimensional structures of P-selectin LE 

25 and SLe^ as derived by x-ray diffraction data of the P-selectin LE: SLe^ crystal. 
Specifically, the three dimensional structures of P-selectin LE and SLe^ are 
defined by the structural coordinates shown in Figure 3, ± a root mean square 
deviation from the backbone atoms of the amino acids of not more than 1.5 A. 
The structural coordinates of P-selectin LE and SLe^ are useful for a number of 

30 applications, including, but not limited to, the visualization, identification and 
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characterization of various active sites of P-seiectin, SLe^ and the P-selectin LE: 
SLe^ complex, including the SLe^ binding site. The active site structures may 
then be used to design various agents which interact with P-selectin LE, SLe^, as 
well as P-selectin LE complexed with SLe^ PSGL-1, or related molecules. 
5 The present invention still further provides a crystal of lectin and 

EGF (LE) domains of E-selectin ("E-selectin LE") complexed with SLe^, as well 
as the three dimensional structures of E-selectin LE and SLe^ as derived by x-ray 
diffraction data of the E-selectin LE: SLe^ crystal. Specifically, the three 
dimensional structures of E-selectin LE and SLe^ are defined by the structural 
10 coordinates shown in Figure 4, ± a root mean square deviation from the 

0 

backbone atoms of the amino acids of not more than 1.5 A. The structural 
coordinates of the three dimensional structures of E-selectin LE and SLe^ are 
useful for a number of applications, including, but not limited to, the 
visualization, identification and characterization of various active sites of E- 

15 selectin LE, SLe^ and the E-selectin LE: SLe^ complex, including the SLe^ binding 
site. The active site structures may then be used to design various agents which 
interact with E-selectin LE, SLe^, as well as E-selectin LE complexed with SLe^, 
PSGL-1 or related molecules. 

Still further, the present invention provides a crystal structure of P- 

20 selectin LE complexed with a functional PSGL-1 peptide modified by both 
tyrosine sulfation and SLe^ as well as the three dimensional structures of P- 
selectin LE and the PSGL-1 peptide as derived by x-ray diffraction data of the P- 
selectin LE: PSGL-1 peptide crystal. Specifically, the three dimensional 
structures of P-selectin LE and the PSGL-1 peptide are defined by the structural 

25 coordinates shown in Figure 5, ± a root mean square deviation from the 

o 

backbone atoms of the amino acids of not more than 1.5 A. The structural 
coordinates of P-selectin LE and the PSGL-1 peptide are useful for a number of 
applications, including, but not limited to, the visualization, identification and 
characterization of various active sites of P-selectin LE, the PSGL-1 peptide and 
30 the P-selectin LE: PSGL-1 complex, including the SLe^ and PSGL-1 binding sites. 
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The active site structures may then be used to design various agents which 
interact with P-selectin LE, PSGL-1, as well as P-selectin LE complexed with 
SLe^, PSGL-1, or related molecules. 

The present invention is also directed to an active site of an SLe"" 
5 binding protein or peptide, and preferably the SLe^ binding site of P-selectin LE, 
comprising the relative structural coordinates of amino acid residues TYR48, 
GLU80, ASN82, GLU92, TYR94, PR098, SER99, ASN105, ASP106, GLU107 and 
bound calcium according to Figure 3, ± a root mean square deviation from the 

o 

backbone atoms of said amino acids of not more than 1.5 A. Alternatively, the 
10 active site may include, in addition to the structural coordinates define above, 
the relative structural coordinates of amino acid residues TYR44, SER46, 
SER47, ALA77, ASP78, ASN79, PR081, ASN83, ARG85, GLU88, CYS90, ILE93, 
LYS96, SER97, ALAIOO, TRP104, HIS108, LYSlll and LYS113 according to 
Figure 3, ± a root mean square deviation from the backbone atoms of the amino 

o 

15 acids of not more than 1.5 A, The SLe^ active site may correspond to the 
configuration of the P-selectin LE in its state of association with an agent, 
preferably, SLe'', or in its unbound state. 

The present invention is further directed to an active site of an 
SLe"" binding protein or peptide, and preferably the SLe'' binding site of E- 

20 selectin LE, comprising the relative structural coordinates of amino acid residues 
TYR48, GLU80, ASN82, ASN83, GLU92, TYR94, ARG97, GLU98, ASN105, 
ASP106, GLU107 and bound calcium according to Figure 4, ± a root mean 
square deviation from the backbone atoms of said amino acids of not more than 
1.5 A. Alternatively, the active site may include, in addition to the structural 

25 coordinates define above, the relative structural coordinates of amino acid 
residues TYR44, SER45, PR046, SER47, ALA77, PR078, GLY79, PR081, 
GLU88, CYS90, LYS99, ASPlOO, TRP104, ARG108, LYSlll and LYS113 
according to Figure 4, ± a root mean square deviation from the backbone atoms 
of the amino acids of not more than 1.5 A. The SLe"" active site may correspond 
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to the configuration of the E-selectin LE in its state of association with an agent, 
preferably, SLe^ or in its unbound state. 

Still further, the present invention provides an active site of a 
PSGL-1 binding protein or peptide, and preferably the PSGL-1 binding site of P- 
5 selectin LE, comprising the relative structural coordinates of amino acid residues 
ALA9, TYR45, SER46, SER47, TYR48, GLU80, ASN82, LYS84, ARG85, GLU88, 
GLU92, TYR94, PR098, SER99, ASN105, ASP106, GLU107, HIS108, LEUllO, 
LYSlll, LYS112, LYS113, HIS114 and bound strontium according to Figure 5, 
± a root mean square deviation from the backbone atoms of said amino acids of 

o 

10 not more than 1.5 A. Alternatively, the active site may include, in addition to 
the structural coordinates define above, the relative structural coordinates of 
amino acid residues SER6, THR7, LYS8, TYRIO, SERll, TYR44, TYR49, TRP50, 
ALA77, ASP78, ASN79, PR081, ASN83, ASN86, ASN87,CYS90, ILE93, ILE95, 
LYS96, SER97, ALAIOO, TRP104 and CYS109 according to Figure 5, ± a root 

15 mean square deviation from the backbone atoms of the amino acids of not more 

o 

than 1.5 A. The PSGL-1 binding site may correspond to the configuration of the 
P-selectin LE in its state of association with an agent, preferably, PSGL-1 or a 
PSGL-1 peptide, or in its unbound state. 

In addition, the present invention provides a method for 

20 identifying an agent that interacts with P-selectin LE, comprising the steps of: 
(a) generating a three dimensional model of P-selectin LE using the relative 
structural coordinates according to Figures 2, 3 or 5, ± a root mean square 
deviation from the backbone atoms of said amino acids of not more than 1.5 A; 
and (b) employing said three-dimensional structure to design or select an 

25 agent.. 

In addition, the present invention provides a method for 
identifying an activator or inhibitor of a molecule or molecular complex 
comprising an SLe"" binding site, comprising the steps of: (a) generating a three 
dimensional model of said molecule or molecular complex comprising an SLe^ 
30 binding site using (i) the relative structural coordinates according to Figure 3 of 
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residues TYR48, GLU80, ASN82, GLU92, TYR94, PR098, SER99, ASN105, 
ASP106, GLU107 and bound calcium, ± a root mean square deviation from the 

o 

backbone atoms of said amino acids of not more than 1.5 A, or (ii) the relative 
structural coordinates according to Figure 4 of amino acid residues TYR48, 
5 GLU80, ASN82, GLU92, TYR94, ARG97, GLU98, ASN105, ASP106, GLU107 and 
bound calcium, ± a root mean square deviation from the backbone atoms of 

o 

said amino acids of not more than 1.5 A; and (b) selecting or designing a 
candidate activator or inhibitor by performing computer fitting analysis with the 
three dimensional model generated in step (a). In another embodiment, the 

10 relative structural coordinates according to Figure 3 further comprises amino 
acid residues TYR44, SER46, SER47, ALA77, ASP78, ASN79, PR081, ASN83, 
ARG85, GLU88, CYS90, ILE93, LYS96, SER97, ALAIOO, TRP104, HIS108, 
LYSlll and LYS113, ± a root mean square deviation from the backbone atoms 
of said amino acids of not more than 1. sA. In yet another embodiment, the 

15 relative structural coordinates according to Figure 4 further comprises the 
amino acid residues TYR44, SER45, PR046, SER47, ALA77, PR078, GLY79, 
PR081, GLU88, CYS90, LYS99, ASPlOO, TRP104, ARG108, LYSlll and 
LYS113, ± a root mean square deviation from the backbone atoms of said 
amino acids of not more than 1.5 A. 

20 The present invention still further provides a method for 

identifying an activator or inhibitor of a molecule or molecular complex 
comprising a PSGL-1 binding site, comprising the steps of: (a) generating a 
three dimensional model of said molecule or molecular complex comprising a 
PSGL-1 binding site using the relative structural coordinates according to Figure 

25 5 of amino acid residues ALA9, TYR45, SER46, SER47, TYR48, GLU80, ASN82, 
LYS84, ARG85, GLU88, GLU92, TYR94, PR098, SER99, ASN105, ASP106, 
GLU107, HIS1G8, LEUllO, LYSlll, LYS112, LYS113, HIS114 and bound 
strontium, ± a root mean square deviation from the backbone atoms of said 

o 

amino acids of not more than 1.5 A; and (b) selecting or designing a candidate 
30 activator or inhibitor by performing computer fitting analysis with the three 
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dimensional model generated in step (a). In another embodiment, the relative 
structural coordinates according to Figure 5 further comprises amino acid 
residues SER6, THR7, LYS8, TYRIO, SERll, TyR44, TYR49, TRP50, ALA77, 
ASP78, ASN79, PR081, ASN83, ASN86, ASN87,CYS90, ILE93, ILE95, LYS96, 
5 SER97, ALAIOO, TRP104 and CYS109, ± a root mean square deviation from the 

o 

backbone atoms of said amino acids of not more than 1.5 A. 

In addition, the present invention provides a method for 
identifying an agent that interacts with SLe^ comprising the steps of: (a) 
generating a three dimensional model of SLe^ using the relative structural 
10 coordinates according to Figures 3 or 4, ± a root mean square deviation from 

o 

the backbone atoms of said amino acids of not more than 1.5 A; and (b) 
employing said three-dimensional structure to design or select an agent that 
interacts with SLe"". 

Still further, the present invention provides a method for 
15 identifying an agent that interacts with PSGL-1, comprising the steps of: (a) 
generating a three dimensional model of a PSGL-1 peptide using the relative 
structural coordinates according to Figure 5, ± a root mean square deviation 

o 

from the backbone atoms of said amino acids of not more than 1.5 A; and (b) 
employing said three-dimensional structure to design or select an agent that 

20 interacts with PSGH . 

The present invention also provides agents, activators or inhibitors 
identified using the foregoing methods. Small molecules or other agents which 
inhibit or otherwise interfere with the selectin-mediated cellular rolling of 
leukocytes over vascular tissue may be useful in the treatment of diseases 

25 involving abnormal inflammatory responses such as asthma and psoriasis. 

Finally, the present invention provides a method for obtaining a 
crystallized complex of an E-selectin type molecule and a compound that 
coordinates calcium. The method comprises the steps of: (a) contacting a 
crystallized E-seiectin type molecule with a compound that coordinates calcium 

30 in the presence of calcium ions and PEG to form a crystallized complex of the E- 
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selectin type molecule and the compound that coordinates calcium; and (b) 
contacting the crystallized complex in the presence of a reduced concentration 
of calcium ions, and sufficient concentrations of PEG and an ionic salt to obtain 
a final crystallized complex, that upon cooling, is suitable for elucidating the 
5 three dimensional structures of the E-selectin type molecule and the compound 
that coordinates calcium by x-ray diffraction of the final crystaUized complex. 

Additional objects of the present invention will be apparent from 
the description which follows. 

10 Brief Description of the Figures 

Figures lA, IB and IC depict the resolution and BIAcore affinity 
analysis of the PSGL-1 19ek peptides. 

Fig. lA: Profile of the PSGL-1 19ek peptides resolved by anion- 
exchange chromatography. See text for definition of peak labels. Inset. 

15 Structure of the major PSGL-1 19ek peptide SGP-3. <Q denotes cyclization of 
the N-terminal Gin residue to pyroglutamate and SO3 represents sulfation of Tjnr 
residues. The over line in the numbered peptide sequence indicates residues of 
non-PSGL-1 origin that are associated with the enterokinase hnker region. 

Figs. lB-1 and lB-2: Representative BIAcore sensorgrams of 

20 solution-phase P-LE binding to immobilized PSGL-1 constructs. P-LE (at 800 nM 
concentration) was injected over sPSGL (Fig. lB-1) and SGP-3 (Fig. lB-2). 
Binding signals of P-LE injected other less modified forms of SGP-3 (not shown) 
were identical to these results with regards to binding kinetics. P-LE binding to 
sPSGL and SGP-3 was inhibited by co-injection with solution-phase SGP-3 

25 (dotted line) but not by a sj^thetic peptide containing no tyrosine sulfation or 
glycosylation (dashed line), both at 20 |J,M concentration. All curves reflect 
specific binding produced by subtracting non-specific binding from total 
binding. 

Figs. lC-1 - lC-6: Binding affinity determinations of solution- 
30 phase P-LE reacted with immobiUzed sPSGL and purified 19ek peptides by 
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BIAcore analysis. First row, left-to-right: sPSGL (Fig. lC-1), SGP-3 (Fig. lC-2) 
and SGP-2 (Fig. lC-3). Second row, left-to-right: SGP-1 (Fig. lC-4), GP-1 (Fig. 
lC-5) and SP-1 (Fig. lC-6). P-LE at the indicated concentrations was reacted 
with immobilized ligands (to determine the total binding signal) and against 
5 control cells containing no ligand (to determine non-specific binding). The 
equilibrium responses (Req) at each concentration of P-LE are shown. At each 
P-LE concentration tested, specific binding signals (squares) were determined by 
subtracting non-specific responses (triangles) from total binding signals 
(diamonds). K^s and standard deviations (shown) were determined by line- 

10 fitting of specific binding curves using BIAevaluation software (BL^Vcore) and are 
the products of three-to-ten separate experiments. Kj^ determinations using line- 
fitting agreed well with values determined by linear regression analysis of 
Scatchard plots (not shown). 

Figure 2 provides the atomic structural coordinates for P-selectin 

15 LE as derived by X-ray diffiraction of a P-selectin LE crystal. "Atom type" refers 
to the atom whose coordinates are being measured. "Residue" refers to the type 
or residue of which each measure atom is a part - i.e., amino acid, cofactor, 
ligand or solvent. The "x, y and z" coordinates indicate the Cartesian 
coordinates of each measured atom's location in the unit cell (A). "Occ" 

20 indicates the occupancy factor. "B" indicates the "B-value", which is a measure 
of how mobile the atom is in the atomic structure (A^). "MOL" indicates the 
segment identification used to identify each molecule in the crystal. Under 
"MOL", "MOLA", "MOLB", "MOLC" and "MOLD" refers to each molecule of P- 
selectin LE, "SOLV" refers to water molecules, "MPDS" refers to MPD molecules 

25 and "GALS" refers to calcium ions. Due to disordered structures, Lysl7 (MOLA), 
Lysl7 (MOLC) and Asn57 (MOLC) of P-selectin are represented as alanines. 

Figure 3 provides the atomic structural coordinates for P-selectin 
LE and SLe"" as derived by X-ray diffraction of a P-selectin LE: SLe"" crystal. 
Figure headings are as noted for Figure 2, except that under "MOL", "A", "B", 

30 "C" and "D" refers to each molecule of P-selectin LE, and SLe^ MDP molecules 
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and calcium ions are not labeled under "MOL". However, SLe"", MDP molecules 
and calcium ions are identified under "Residue". Due to disordered structures, 
Lysl7 (MOIA), Lysl7 (MOLC) and Asn57 (MOLC) of P-selectin are represented 
as alanines. 

5 Figure 4 provides the atomic structural coordinates for E-selectin 

LE and SLe"" as derived by X-ray diffraction of a P-selectin LE: SLe"" crystal. 
Figure headings are as noted for Figure 2, except that under "MOL", "SOLV" 
refers to water molecules, and E-selectin LE, SLe'' and calcium are not labeled 
under "MOL". However, E-selectin LE, SLe"" and calcium are identified under 
10 "Residue". 

Figure 5 provides the atomic structural coordinates for P-selectin 
LE and PSGL-1 peptide as derived by X-ray diffraction of a P-selectin:PSGL-l 
peptide crystal. Figure headings are as noted for Figure 2, except that Figure 5 
does not include a "MOL" heading. However, each molecule of P-selectin LE, 

15 PSGL-1 peptide, water and MPD are identified under "Residue". Due to 

disordered structures, Asn57 (MOLA), Lys58 (MOLA), Asn71 (MOLA), Arg22 
(MOLB), Asn57 (MOLB), Lys58 (MOLB), Glu72 (MOLB), Metl25 (MOLB) and 
Argl57 (MOLB) of P-selectin are represented as alanines. 

Figures 6A, 6B and 6C provide the amino acid sequences of P- 

20 selectin (Fig. 6A), E-selectin (Fig. 6B) and PSGL-1 (Fig. 6C). The segments of 
the sequences used to make the constructs in the crystals are underlined. 

Detailed Description of the Invention 
As used herein, the following terms and phrases shall have the 
25 meanings set forth below; 

Unless otherwise noted, "selectins" are a family of cell-surface 
glycoproteins responsible for early adhesion events in the recruitment of 
leukocytes into sites of inflammation and their emigration into lymphatic 
tissues, and include P-selectin, E-selectin and L-selectin, and analogues thereof 
30 having selectin activity. P-selectin preferably has the amino acid sequence 
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depicted in Figure 6A, including conservative substitions. E-selection preferably 
has the amino acid sequence in Figure 6B, including conservative substitutions. 
An "E-selectin type molecule" includes the entire E-selectin molecule as well as 
portions thereof, such as the lectin and/or epidermal growth factor (EGF)-like 
5 domains, and preferably is "E-selectin LE" as defined below. 

The "LE domains" represent the lectin and epidermal growth 
factor (EGF)-like domains of selectin, and as used herein, "P-selectin LE" 
represents the lectin and EGF-like domains of P-selectin, while "E-selectin LE" 
represents the lectin and EGF-like domains of E-selectin. The amino acid 

10 sequences for P-selectin LE and E-selectin LE are shown (underlined) in Figures 
6A and 6B, respectively, and include conservative substitutions. 

"SLe^" represents sialyl Lewis^ (SLe^ NeuNAc2,3Gall,4[Fucl,3] 
GlcNAc) tetrasaccharide, and includes "SLe^ analogues" having similar structure 
and activity as SLe^. An "SLe^ binding protein or peptide" is a protein or peptide 

15 that binds SLe^ and has an SLe^ binding site, and includes but is not limited to 
P-selectin, E-selectin, P-selectin LE and E-selectin LE. A "molecule or molecular 
complex comprising an SLe^ binding site" includes (i) P-selectin, E-selectin, P- 
selectin LE, E-selectin LE, (ii) complexes of P-selectin, E-selectin, P-selectin LE, 
or E-selectin LE with SLe^, (iii) complexes of P-selectin, E-selectin, P-selectin LE, 

20 or E-selectin LE with other molecules, and (iv) other molecules or molecular 
complexes having an SLe^ binding site. 

"PSGL-1" is a molecule having PSGL-1 activity, and includes a 
"PSGL-1 peptide" as defined below. The amino acid sequence of PSGL-1 is 
depicted in Figure 6C, and includes conservative substitutions thereof. "PSGL-1 

25 peptide" is a peptide modified by tyrosine sulfation and SLe^, and includes the 
peptide structure depicted in Figure lA (denoted as SGP-3), including 
conservative substitutions thereof. A "PSGL-1 binding protein or peptide" is a 
protein or peptide that binds to PSGL-1 and has a PSGL-1 binding site, and 
includes but is not limited to P-selectin, E-selectin, P-selectin LE and E-selectin 

30 LE. A "molecule or molecular complex comprising a PSGL-1 binding site" 
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includes (i) P-selectin, E-selectin, P-selectin LE, E-selectin LE, (ii) complexes of 
P-selectin, E-selectin, P-selectin LE, or E-selectin LE with PSGL-1, (iii) complexes 
of P-selectin, E-selectin, P-selectin LE, or E-seiectin LE with other molecules, and 
(iv) other molecules or molecular complexes having a PSGL-1 binding site. 
5 Unless otherwise indicated, "protein" or "molecule" shall include a 

protein, protein domain, polypeptide or peptide. 

"Structural coordinates" are the Cartesian coordinates 
corresponding to an atom's spatial relationship to other atoms in a molecule or 
molecular complex. Structural coordinates may be obtained using x-ray 

10 crystallography techniques or NMR techniques, or may be derived using 
molecular replacement analysis or homology modeling. Various software 
programs allow for the graphical representation of a set of structural 
coordinates to obtain a three dimensional representation of a molecule or 
molecular complex. The structural coordinates of the present invention may be 

15 modified from the original sets provided in Figures 2, 3, 4 or 5 by mathematical 
manipulation, such as by inversion or integer additions or subtractions. As such, 
it is recognized that the structural coordinates of the present invention are 
relative, and are in no way specifically limited by the actual x, y, z coordinates 
of Figures 2, 3, 4 or 5. 

20 An "agent" shall include a protein, polypeptide, peptide, nucleic 

acid, including DNA or RNA, molecule, compound or drug. 

"Root mean square deviation" is the square root of the arithmetic 
mean of the squares of the deviations from the mean, and is a way of expressing 
deviation or variation from the structural coordinates described herein. The 

25 present invention includes all embodiments comprising conservative 

substitutions of the noted amino acid residues resulting in same structural 
coordinates within the stated root mean square deviation. 

It will be obvious to the skilled practitioner that the numbering of 
the amino acid residues of P-selectin, E-selectin, P-selectin LE, E-selectin LE, 

30 PSGL-1 and the PSGL-1 peptide may be different than that set forth herein, and 
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may contain certain conservative amino acid substitutions that yield the same 
three dimensional structures as those defined by Figures 2, 3, 4 or 5 herein. 
Corresponding amino acids and conservative substitutions in other isoforms or 
analogues are easily identified by visual inspection of the relevant amino acid 
5 sequences or by using commercially available homology software programs 
(e.g., MODELLAR, MSI, San Diego, CA). 

"Conservative substitutions" are those amino acid substitutions 
which are functionally equivalent to the substituted amino acid residue, either 
by way of having similar polarity, steric arrangement, or by belonging to the 

10 same class as the substituted residue (e»g., hydrophobic, acidic or basic), and 
includes substitutions having an inconsequential effect on the three dimensional 
structures of P-selectin LE, E-selectin LE, SLe^, the PSGL-1 peptide, the P-selectin 
LE: SLe^ complex, the E-selectin LE: SLe^ complex, and the P-selectin LE: PSGL- 
1 peptide complex, with respect to the use of said structures for the 

15 identification and design of agents which interact with P-selectin, E-selectin, P- 
selectin LE, E-selectin LE, SLe^ PSGL-1, the PSGL-1 peptide, the P-selectin LE: 
SLe^ complex, the E-selectin LE: SLe^ complex, the P-selectin LE: PSGL-1 peptide 
complex, as well as other proteins, peptides, molecules or molecular complexes 
comprising an SLe^ or PSGL-1 binding site, for molecular replacement analyses 

20 and/or for homology modeling. 

An "active site" refers to a region of a molecule or molecular 
complex that, as a result of its shape and charge potential, favorably interacts or 
associates with another agent (including, without limitation, a protein, 
polypeptide, peptide, nucleic acid, including DNA or RNA, molecule, compound 

25 or drug) via various covalent and/or non-covalent binding forces. As such, an 
active site of the present invention may include, for example, the actual site of 
SLe^ or PSGL-1 binding with P-selectin LE or E-selectin LE, as well as accessory 
binding sites adjacent or proximal to the actual site of SLe^ or PSGL-1 binding 
that nonetheless may affect P-selectin LE or E-selectin LE activity upon 

30 interaction or association with a particular agent, either by direct interference 
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with the actual site of SLe^ or PSGL-1 binding or by indirectly affecting the steric 
conformation or charge potential of the P-selectin LE or E-selectin LE and 
thereby preventing or reducing binding of SLe^ or PSGL-1 to P-selectin LE or E- 
selectin LE at the actual site of SLe^ or PSGL-1 binding. As used herein, an 
5 "active site" also includes analog residues of P-selectin LE and E-selectin LE 

which exhibit observable NMR perturbations in the presence of a binding ligand, 
such as SLe^ or PSGL-1. While such residues exhibiting observable NMR 
perturbations may not necessarily be in direct contact with or immediately 
proximate to ligand binding residues, they may be critical P-selectin LE and E- 

10 selectin LE residues for rational drug design protocols. 

The present invention first provides a crystallized P-selectin LE. In 
a particular embodiment, P-selectin LE comprises the amino acid residues set 
forth in Figure 6 A (underlined), including conservative substitutions. The 
crystal of the present invention effectively diffracts X-rays for the determination 

15 of the structural coordinates of P-selectin LE, and is characterized as being in 
plate form with space group P2i, and having unit cell parameters of a=81.0A, 

o o 

b=60.8A, c=91.4A, and beta^^lOS.e"". Further, a crystallographic asjTiimetric 
unit of the crystallized P-selectin LE contains four molecules of P-selectin LE. 

The present invention also provides a crystallized complex 
20 comprising P-selectin LE and SLe"". In a particular embodiment, the amino acid 
sequence of P-selectin LE is set forth in Figure 6A (underlined), and includes 
conservative substitutions. The crystal complex of the present invention 
effectively diffracts X-rays for the determination of the structural coordinates of 
the complex of P-selectin LE and SLe'', and is characterized as being in plate 

o 

25 form with space group P2i, and having unit cell parameters of a=81.lA, 

b=60.5A, c=91.4A, and beta^lOS.S''. Further, the crystallized complex of the 
present invention consists of one molecule of the P-selectin LE: SLe"" complex in 
the asynmietric crystal unit. 

The present invention further provides a crystallized complex 

30 comprising E-selectin LE and SLe"^. In a particular embodiment, the amino acid 
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sequence of E-selectin LE is set forth in Figure 6B (underlined), and includes 
conservative substitutions. The crystal complex of the present invention 
effectively diffracts X-rays for the determination of the structural coordinates of 
the complex of E-selectin LE and SLe^ and is characterized as being in rod form 
5 with space group P2i2i2i, and having unit cell parameters of a=34.5A, 

o o 

b=72,4A, and c=77.6A. Further, the crystaUized complex of the present 
invention consists of one molecule of the E-selectin LE: SLe"" complex in the 
asymmetric crystal unit. 

The present invention still further provides a crystallized complex 

10 comprising P-selectin LE and a PSGL-1 peptide. The crystal complex of the 
present invention effectively diffracts X-rays for the determination of the 
structural coordinates of the complex of P-selectin LE and PSGL-1 peptide^ and 
is characterized as being in bipyramidal form with space group 1222 and having 
unit cell parameters of a=63.4A, b=96.8A, and c=187.3A. Further, the 

15 crystaUized complex of the present invention consists of one molecule of the PE- 
selectin LE: PSGL-1 peptide complex in the asymmetric crystal unit. 

Once a crystal or crystal complex of the present invention is 
grown. X-ray diffraction data can be collected by a variety of means in order to 
obtain the atomic coordinates of the crystallized molecule or molecular 

20 complex. With the aid of specifically designed computer software, such 

crystallographic data can be used to generate a three dimensional structure of 
the molecule or molecular complex. Various methods used to generate and 
refine the three dimensional structure of a crystaUized molecule or molecular 
structure are weU known to those skiUed in the art, and include, without 

25 limitation, multiwavelength anomalous dispersion (MAD), multiple 

isomorphous replacement, reciprocal space solvent flattening, molecular 
replacement, and single isomorphous replacement with anomalous scattering 
(SIRAS). 

Accordingly, the present invention also provides the three 
30 dimensional structure of P-selectin LE as derived by x-ray diffraction data of the 
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P-selectin LE crystal. Specifically, the three dimensional structure of P-selectin 
LE is defined by the structural coordinates shown in Figure 2, ± a root mean 
square deviation firom the backbone atoms of the amino acids of not more than 
1.5 A, preferably not more than l.OA, and most preferably not more than 0.5 A. 
5 The structural coordinates of the three dimensional structure of P-selectin LE are 
useful for a number of applications, including, but not limited to, the 
visualization, identification and characterization of various active sites of P- 
selectin LE, including the SLe^ binding site. The active site structures may then 
be used to design agents which interact with P-selectin LE, as well as P-selectin 

10 LE complexed with SLe^ PSGL-1, or related molecules. 

In addition, the present invention provides the three dimensional 
structures of P-selectin LE and SLe^ as derived by x-ray diffraction data of the P- 
selectin LE; SLe^ crystal. Specifically, the three dimensional structures of P- 
selectin LE and SLe^ are defined by the structural coordinates shown in Figure 3, 

15 ± a root mean square deviation from the backbone atoms of the amino acids of 
not more than 1.5 A, preferably not more than 1.0 A, and most preferably not 

o 

more than 0.5A. The structural coordinates of P-selectin LE and SLe^ are useful 
for a number of applications, including, but not limited to, the visualization, 
identification and characterization of various active sites of P-selectin LE, SLe^ 
20 and the P-selectin LE: SLe^ complex, including the SLe^ binding site. The active 
site structures may then be used to design agents with interact with P-selectin 
LE, SLe^, as well as P-selectin LE complexed with SLe^ PSGL-1, or related 
molecules. 

The present invention further provides the three dimensional 
25 structures of E-selectin LE and SLe^ as derived by x-ray diffraction data of the E- 
selectin LE: SLe^ crystal. Specifically, the three dimensional structures of E- 
selectin LE and SLe^ are defined by the structural coordinates shown in Figure 4, 
± a root mean square deviation from the backbone atoms of the amino acids of 
not more than 1.5A, preferably not more than l.OA, and most preferably not 
30 more than 0.5A. The structural coordinates of E-selectin LE and SLe^ are useful 
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for a number of applications, including, but not limited to, the visualization, 
identification and characterization of various active sites of E-selectin LE, SLe^ 
and the E-selectin LE: SLe^ complex, including the SLe^ binding site. The active 
site structures may then be used to design agents which interact with E-selectin 
5 LE, SLe^ as well as E-selectin LE complexed with SLe^, PSGL-1, or related 
molecules. 

Still further, the present invention provides the three dimensional 
structures of the P-selectin LE and a PSGL-1 peptide as derived by x-ray 
diffraction data of the P-selectin LE: PSGL-1 peptide crystal. Specifically, the 

10 three dimensional structures of P-selectin LE and the PSGL-1 peptide are defined 
by the structural coordinates shown in Figure 5, ± a root mean square deviation 
from the backbone atoms of the amino acids of not more than 1.5 A, preferably 
not more than 1.0 A, and most preferably not more than 0,5 A. The structural 
coordinates P-selectin LE and the PSGL-1 peptide are useful for a number of 

15 applications, including, but not limited to, the visualization, identification and 
characterization of various active sites of P-selectin LE, PSGL-1 and the P- 
selectin LE: PSGL-1 peptide complex, including PSGL-1 (and SLe^) binding sites. 
The active site structures may then be used to design agents which interact with 
P-selectin LE, PSGL-1, as well as P-selectin LE complexed with SLe^ PSGL-1, or 

20 related molecules. 

The present invention is also directed to an active site of an SLe'' 
binding protein or peptide, and preferably the SLe"" binding site of P-selectin LE, 
comprising the relative structural coordinates of amino acid residues TYR48, 
GLU80, ASN82, GLU92, TYR94, PR098, SER99, ASN105, ASP106, GLU107 and 

25 bound calcium according to Figure 3, ± a root mean square deviation from the 

o 

backbone atoms of said amino acids of not more than 1.5 A, preferably not more 

o o 

than l.OA, and most preferably not more than 0.5 A, Alternatively, the active 
site may include, in addition to the structural coordinates define above, the 
relative structural coordinates of amino acid residues TYR44, SER46, SER47, 
30 ALA77, ASP78, ASN79, PR081, ASN83, ARG85, GLU88, CYS90, ILE93, LYS96, 

163279.1 



-21- 

SER97, ALAIOO, TRP104, HIS108, LYSlll and LYS113 according to Figure 3, 
± a root mean square deviation from the backbone atoms of the amino acids of 

o o 

not more than 1.5 A, preferably not more than 1.0 A, and most preferably not 

o 

more than 0.5A. The SLe"" active site may correspond to the configuration of P- 
5 selectin LE in its state of association v\rith an agent, preferably, SLe"", or in its 
unbound state. 

The present invention is further directed to an active site of an 
SLe"" binding protein or peptide, and preferably the SLe^ binding site of E- 
selectin LE, comprising the relative structural coordinates of amino acid residues 

10 TYR48, GLU80, ASN82, ASN83, GLU92, TYR94, ARG97, GLU98, ASN105, 
ASP106, GLU107 and bound calcium according to Figure 4, ± a root mean 
square deviation from the backbone atoms of said amino acids of not more than 
1.5 A, preferably not more than 1,0A, and most preferably not more than 0.5 A. 
Alternatively, the active site may include, in addition to the structural 

15 coordinates define above, the relative structural coordinates of amino acid 
residues TYR44, SER45, PR046, SER47, ALA77, PR078, GLY79, PR081, 
GLU88, CYS90, LYS99, ASPlOO, TRP104, ARG108, LYSlll and LYS113 
according to Figure 4, ± a root mean square deviation from the backbone atoms 

o o 

of the amino acids of not more than 1.5 A, preferably not more than 1.0 A, and 

o 

20 most preferably not more than O.SA. The SLe^ active site may correspond to the 
configuration of E-selectin LE in its state of association with an agent, 
preferably, SLe^ or in its unbound state. 

Still further, the present invention provides an active site of a 
PSGL-1 binding protein or peptide, and preferably the PSGL-1 binding site of P- 

25 selectin LE, comprising the relative structural coordinates of amino acid residues 
ALA9, TYR45, SER46, SER47, TYR48, GLU80, ASN82, LYS84, ARG85, GLU88, 
GLU92, TYR94, PR098, SER99, ASN105, ASP106, GLU107, HIS108, LEUllO, 
LYSlll, LYS112, LYS113, HIS114 and bound strontium according to Figure 5, 
± a root mean square deviation from the backbone atoms of said amino acids of 

30 not more than 1.5 A, preferably not more than 1.0 A, and most preferably not 
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more than 0.5A. Alternatively, the active site may include, in addition to the 
structural coordinates define above, the relative structural coordinates of amino 
acid residues SER6, THR7, LYS8, TYRIO, SERll, TYR44, TYR49, TRP50, 
ALA77, ASP78, ASN79, PR081, ASN83, ASN86, ASN87,CYS90, ILE93, ILE95, 
5 LYS96, SER97, ALAIOO, TRP104 and CYS109 according to Figure 5, ± a root 
mean square deviation firom the backbone atoms of the amino acids of not more 

o o 

than 1.5 A, preferably not more than 1.0 A, and most preferably not more than 

o 

0.5A. The PSGL-1 binding site may correspond to the configuration of P- 
selectin LE in its state of association with an agent, and preferably, PSGL-1 or a 

10 PSGL-1 peptide, or in its unbound state. It is also within the confines of the 
present invention that strontium may be substituted with calcium for purposes 
of using the structural coordinates for drug design. 

Another aspect of the present invention is directed to a method for 
identifying an agent that interacts with a binding or active site of P-selectin LE. 

1 5 Specifically, the method comprises the steps of: (a) generating a three 

dimensional model of P-selectin LE using the relative structural coordinates 
according to Figures 2, 3 or 5, ± a root mean square deviation from the 
backbone atoms of said amino acids of not more than 1.5 A, preferably not more 

o o 

than 1.0 A, and most preferably not more than 0.5 A; and (b) employing said 
20 three-dimensional structure to design or select an agent. The agent may be 
identified using computer fitting analyses utilizing various computer software 
programs that evaluate the "fit" between the putative active site and the 
identified agent, by (a) generating a three dimensional model of the putative 
active site of a molecule or molecular complex using homology modeling or the 
25 atomic structural coordinates of the active site, and (b) determining the degree 
of association between the putative active site and the identified agent. Three 
dimensional models of the putative active site may be generated using any one 
of a number of methods known in the art, and include, but are not limited to, 
homology modeling as well as computer analysis of raw data generated using 
30 crystallographic or spectroscopy data. Computer programs used to generate such 
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three dimensional models and/or perform the necessary fitting analyses include, 
but are not limited to: GRID (Oxford University, Oxford, UK), MCSS (Molecular 
Simulations, San Diego, CA), AUTODOCK (Scripps Research Institute, La JoUa, 
CA), DOCK (University of California, San Francisco, CA), Flo99 (Thistiesoft, 
5 Morris Township, NJ), Ludi (Molecular Simulations, San Diego, CA), QUANTA 
(Molecular Simulations, San Diego, CA), Insight (Molecular Simulations, San 
Diego, CA), SYBYL (TRIPOS, Inc., St. Louis. MO) and LEAPFROG (TRIPOS, Inc., 
St. Louis, MO). 

The effect of such an agent identified by computer fitting analyses 

10 on P-selectin LE activity may be further evaluated by contacting the identified 
agent with P-selectin LE and measuring the effect of the agent on P-selectin LE 
activity. Depending upon the action of the agent on the active site of P-selectin 
LE, the agent may act either as an inhibitor or activator of P-selectin LE activity. 
For example, enzjnnatic assays may be performed and the results analyzed to 

15 determine whether the agent is an inhibitor of P-selectin LE and SLe"" (i.e., the 
agent may reduce or prevent binding affinity between P-selectin LE and SLe'') or 
an activator of P-selectin LE and SLe'' (i.e., the agent may increase binding 
affinity between P-selectin and SLe""). Further tests may be performed to 
evaluate the potential therapeutic efficacy of the identified agent on conditions 

20 associated with selectins such as inflammation. 

The present invention is not limited to identifying agents which 
interact with an active site of P-selectin LE, but also is directed to a method for 
identifying an activator or inhibitor of any molecule or molecular complex 
comprising an SLe'' binding site or a PSGL-1 binding site, including but not 

25 Umited to P-selectin, E-selectin, E-selectin LE, P-selectin LE: SLe"" complex, and 
E-selectin LE: SLe"" complex. 

In one embodiment, the present invention provides a method for 
identifying an activator or inhibitor of a molecule or molecular complex 
comprising an SLe"" binding site, comprising the steps of: (a) generating a three 

30 dimensional model of said molecule or molecular complex comprising an SLe'' 
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binding site using (i) the relative structural coordinates according to Figure 3 of 
residues TYR48, GLU80, ASN82, GLU92, TYR94, PR098, SER99, ASN105, 
ASP106, GLU107 and bound calcium, ± a root mean square deviation from the 

o 

backbone atoms of said amino acids of not more than 1.5 A, preferably not more 
5 than l.OA, and most preferably not more than 0.5A or (ii) the relative 
structural coordinates according to Figure 4 of amino acid residues TYR48, 
GLU80, ASN82, GLU92, TYR94, ARG97, GLU98, ASN105, ASP106, GLU107 and 
bound calcium, ± a root mean square deviation from the backbone atoms of 

o o 

said amino acids of not more than 1.5 A, preferably not more than 1.0 A, and 
10 most preferably not more than 0.5A; and (b) selecting or designing a candidate 
activator or inhibitor by performing computer fitting analysis with the three 
dimensional model generated in step (a). In another embodiment, the relative 
structural coordinates according to Figure 3 further comprises amino acid 
residues TYR44, SER46, SER47, ALA77, ASP78, ASN79, PR081, ASN83, 
15 ARG85, GLU88, CYS90, ILE93, LYS96, SER97, ALAIOO, TRP104, HIS108, 

LYSlll and LYS113, ± a root mean square deviation from the backbone atoms 

o o 

of said amino acids of not more than 1.5 A, preferably not more than 1.0 A, and 
most preferably not more than O.sA. In yet another embodiment, the relative 
structural coordinates according to Figure 4 further comprises the amino acid 
20 residues TYR44, SER45, PR046, SER47, ALA77, PR078, GLY79, PR081, 

GLU88, CYS90, LYS99, ASPlOO, TRP104, ARG108, LYSlll and LYS113, ± a 
root mean square deviation from the backbone atoms of said amino acids of not 
more than 1.5 A, preferably not more than 1.0 A, and most preferably not more 

o 

than 0.5A. Once the candidate activator or inhibitor is obtained or synthesized, 
25 the candidate activator or inhibitor may be contacted with the molecule or 
molecular complex, and the effect the candidate activator or inhibitor has on 
said molecule or molecular complex may be determined. Preferably, the 
candidate activator or inhibitor is contacted with the molecule or molecule 
complex in the presence of SLe"" (or a molecule or molecular complex 
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comprising SLe"") in order to determine the effect the candidate activator or 
inhibitor has on binding of the molecule or molecular complex to SLe"". 

In yet another embodiment, the present invention provides a 
method for identifying an activator or inhibitor of a molecule or molecular 
5 complex comprising a PSGL-1 binding site, comprising the steps of: (a) 

generating a three dimensional model of said molecule or molecular complex 
comprising a PSGL-1 binding site using the relative structural coordinates 
according to Figure 5 of amino acid residues ALA9, TYR45, SER46, SER47, 
TYR48, GLU80, ASN82, LYS84, ARG85, GLU88, GLU92, TYR94, PR098, SER99, 
10 ASN105, ASP106, GLU107, HIS108, LEUllO, LYSlll, LYS112, LYS113, HIS114 
and bound strontium, ± a root mean square deviation from the backbone atoms 

o o 

of said amino acids of not more than 1.5 A, preferably not more than 1.0 A, and 

o 

most preferably not more than 0.5 A; and (b) selecting or designing a candidate 
activator or inhibitor by performing computer fitting analysis with the three 
15 dimensional model generated in step (a). In another embodiment, the relative 
structural coordinates according to Figure 5 further comprises amino acid 
residues SER6, THR7, LYS8, TYRIO, SERll, TYR44, TYR49, TRP50, ALA77, 
ASP78, ASN79, PR081, ASN83, ASN86, ASN87,CYS90, ILE93, ILE95, LYS96, 
SER97, ALAIOO, TRP104 and CYS109, ± a root mean square deviation from the 

o 

20 backbone atoms of said amino acids of not more than 1.5 A, preferably not more 

o o 

than 1.0 A, and most preferably not more than 0.5 A. Once the candidate 
activator or inhibitor is obtained or synthesized, the candidate activator or 
inhibitor may be contacted with the molecule or molecular complex, and the 
effect the candidate activator or inhibitor has on said molecule or molecular 
25 complex may be determined. Preferably, the candidate activator or inhibitor is 
contacted with the molecule or molecule complex in the presence of PSGL-1 or a 
PSGL-1 peptide in order to determine the effect the candidate activator or 
inhibitor has on binding of the molecule or molecular complex to PSGL-1 or the 
PSGL-1 peptide. Here again, it is also within the confines of the present 
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invention that strontium may be substituted with calcium for purposes of using 
the structural coordinates for drug design. 

In addition, the structural coordinates of SLe'' as set forth in 
Figures 3 or 4, and the structural coordinates of the PSGL-1 peptide set forth in 
5 Figure 5 can be used for identifying or designing agents which interact with SLe"" 
and PSGL-1, respectively. In this regard, the present invention provides a 
method for identifying an agent that interacts with SLe'', comprising the steps of: 
(a) generating a three dimensional model of SLe"" using the relative structural 
coordinates according to Figures 3 or 4, ± a root mean square deviation from 

o 

10 the backbone atoms of said amino acids of not more than 1.5 A, preferably not 

o o 

more than l.OA, and most preferably not more than 0.5 A; and (b) employing 
said three-dimensional structure to design or select an agent that interacts with 
SLe"". The identified agent can then be synthesized or obtained, and then 
contacted with SLe'' (or a molecule or molecular complex comprising SLe"") to 
15 determine the effect the agent has on SLe"" activity. 

Still further, the present invention provides a method for 
identifying an agent that interacts with PSGL-1, comprising the steps of: (a) 
generating a three dimensional model of a PSGL-1 peptide using the relative 
structural coordinates according to Figure 5, ± a root mean square deviation 

o 

20 from the backbone atoms of said amino acids of not more than 1.5 A, preferably 

o o 

not more than l.OA, and most preferably not more than 0.5 A; and (b) 
emplojdng said three-dimensional structure to design or select an agent that 
interacts with PSGL-1. The identified agent can then be synthesized or 
obtained, and then contacted with PSGL-1 or the PSGL-lpeptide (or a molecule 

25 or molecular complex comprising PSGL-1 or the PSGL-1 peptide) to determine 
the effect the agent has on PSGL-1 or the PSGL-1 peptide activity. 

Various molecular analysis and rational drug design techniques 
are further disclosed in U.S. Patent Nos. 5,834,228, 5,939,528 and 5,865,116, 
as well as in PCT AppUcation No. PCT/US98/16879, published WO 99/09148, 

30 the contents of which are hereby incorporated by reference. 
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The present invention is also directed to the agents, activators or 
inhibitors identified using the foregoing methods. Such agents, activators or 
inhibitors may be a protein, polypeptide, peptide, nucleic acid, including DNA or 
RNA, molecule, compound, or drug. Small molecules or other agents v^hich 
5 inhibit or otherwise interfere with the selectin-mediated cellular rolling of 
leukocytes over vascular tissue may be useful in the treatment of diseases 
involving abnormal inflammatory responses such as asthma and psoriasis. 

In addition, the present invention is directed to a method for 
determining the three dimensional structure of a molecule or molecular complex 

10 whose structure is unknown, comprising the steps of obtaining crystals of the 
molecule or molecular complex whose structure is unknown and generating X- 
ray diffraction data from the crystallized molecule or molecular complex. The 
X-ray diffraction data from the molecule or molecular complex is then compared 
with the known three dimensional structure determined from any of the 

15 aforementioned crystals of the present invention. Then, the known three 

dimensional structure determined from the crystals of the present invention is 
"conformed" using molecular replacement analysis to the X-ray diffraction data 
from the crystallized molecule or molecular complex. Alternatively, 
spectroscopic data or homology modeling may be used to generate a putative 

20 three dimensional structure for the molecule or molecular complex, and the 
putative structure is refined by conformation to the known three dimensional 
structure determined from any of the crystals of the present invention. 

Finally, the present invention provides a method for obtaining a 
crystaUized complex of an E-selectin type molecule and a compound that 

25 coordinates calcium such as SLe''. The method comprises the steps of: (a) 
contacting a crystallized E-selectin type molecule with a compound that 
coordinates calcium in the presence of calcium ions and PEG to form a 
crystallized complex of the E-selectin type molecule and the compound that 
coordinates calcium; and (b) contacting the crystallized complex in the presence 

30 of a reduced concentration of calcium ions, and sufficient concentrations of PEG 
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and an ionic salt to obtain a final crystallized complex, that upon cooling, is 
suitable for elucidating the three dimensional structures of the E-selectin tj^e 
molecule and the compound that coordinates calcium by x-ray diffraction of the 
final crystallized complex. 
5 In the method, the "E-selectin type molecule" may be the entire E- 

selectin molecule as well as portions thereof, such as the lectin and/or 
epidermal growth factor (EGF)-Uke domains, and preferably is E-selectin LE. 
The compound that coordinates calcium is preferably SLe"^. In step (a), the 
crystallized E-selectin type molecule may be prepared by procedures known in 

10 the art. When the crystallized E-selection type molecule is E-selectin LE, the 
crystallized E-selectin LE is preferably prepared as described in Example 1 
below. The source of calcium ions in the method is preferably CaCl2, while the 
PEG is preferably PEG-1000 to PEG-20,000, and most preferably PEG-4000. 
The ionic salt may be a number of ionic salts known in the art, and preferably is 

15 NaCl. 

An important aspect of the method is the reduction of calcium ions 
in step (b), or the use of an effective calcium concentration that, prevents or 
reduces the affect of calcium in inhibiting binding of the compound that 
coordinates calcium to the E-selectin type molecule, thereby permitting the 

20 formation of a final crystal complex that a suitable for elucidating the three 
dimensional structures of the E-selectin type molecule and the compound that 
coordinates calcium by x-ray diffraction of the final crystallized complex. That 
is, the concentration of calcium should be sufficiently low to permit the 
compound that coordinates calcium to bind to its binding site on the E-selectin 

25 type molecule. For example, if the compound that coordinates calcium is SLe^ 
then it is envisioned that the concentration of calcium ions in step (a) may 
range from 20 mM to 300 mM, while the concentration of calcium ions in step 
(b) is lower (i.e. in the range firom 100 \xM to 20 mM). In steps (a) and(b), 
PEG should be at a concentration sufficient to maintain the integrity of the 

30 crystal in view of the reduced or lower concentration of calcium, and is 
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preferably about 15% to 60% (w/v) PEG. The concentration of the ionic salt in 
step (b) is about 10 mM to 500 mM. The contacting in step (a) may be affected 
for about 10-20 hours, and preferably about 15 hours when the compound that 
coordinates crystal is SLe''. The contacting in step (b) is affected for about 0.5-3 
5 hours, and preferably for about 1 hour. 

It is also within the confines of the present invention that steps (a) 
and (b) can be combined in which case the crystallized E-selectin type molecule 
is contacted with a compound that coordinates calcium in the presence of 
sufficient concentrations of calcium ions, PEG and an ionic salt to form a 

10 crystallized complex (of the E-selectin type molecule and the compound that 
coordinates calcium) that is suitable for elucidating the three dimensional 
structures of the E-selectin type molecule and the compound that coordinates 
calcium by x-ray diffraction of the final crystallized complex. If steps (a) and 
(b) are combined, then it is envisioned that the concentrations of calcium ions, 

15 PEG and the ionic salt are about 100 |JlM to 20 mM, 15% to 60% (w/v) and 10 
mM to 500 mM, respectively. 

The present invention may be better understood by reference to 
the following non-limiting Example. The following Example is presented in 
order to more fully illustrate the preferred embodiments of the invention, and 

20 should in no way be construed as limiting the scope of the present invention. 

Example 1 

1. Experimental Procedures 

Generation of Constructs and Protein/Peptide Preparation . The lectin-EGF (LE) 
25 domains (153 amino acids) of P-selectin (P-LE) and E-selectin (E-LE) fused to 
the CH2-CH3 region of IgGj via an intervening enterokinase cleavage sequence 
(Asp-Asp-Asp-Asp-Lys) were expressed in CHO cells and recovered from 
conditioned media by protein A sepharose (Pharmacia) chromatography. 
Monomeric selectin LE domains were produced by digestion of the dimeric Fc 
30 constructs with enterokinase (LaVallie et al., 1993) and the enzyme and residual 
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Fc domains were removed by chromatography over tandem soy bean trypsin 
inhibitor-agarose (Sigma) and protein A (Perseptive Biosystems) columns. The 
selectin LE domains were deglycosylated at 37° C for 48 hrs at a ratio of 25 
milliunits N-glycanase/mg protein and purified by anion exchange and 
5 hydrophobic interaction chromatography. LE domains were brought to 10-30 
mg/ml by vacuum concentration. Both P-LE and E-LE were determined to be 
correct by mass spectrometry (MS), monomeric by gel filtration HPLC, and 
functional by surface plasmon resonance (BIAcore) analysis (see below) . 

A soluble construct (19ek.Fc) containing the N-terminal 19 amino 

10 acids of PSGL-1 fused to the Fc region of IgG^ via a nine amino acid linker 

containing the enterokinase cleavage sequence was described earlier (Goetz et 
al., 1997). 19ek.Fc was purified, digested with enterokinase, and the 
monomeric PSGL-1 19ek peptides were recovered as in the above for the LE 
constructs. The heterogenous 19ek peptides were purified to individual species 

15 by SuperQ anion-exchange chromatography using a gradient of 0-500 mM NaCl. 
Their structures were determined by MS before and after proteolytic and 
glycosidic digestions, NMR, and by composition analyses (J. Rouse, D. Tsao, and 
R. Camphausen, unpublished data). 

20 P-LE/19ek Peptide Binding Studies . Surface plasmon resonance was performed 
on BIAcore 2000 and 3000 instruments at 25' C using streptavidin-coated 
sensor chips (BIAcore) and HBS-P buffer (BIAcore; 10 mM HEPES (pH 7.4), 150 
mM NaCl, and 0.005% polysorbate 20 (v/v)) adjusted to 1 mM each CaCls and 
MgCl2. The 19ek peptides and sPSGL, a fucosyltransferase-VII modified version 

25 of the entire dimeric extracellular domain of PSGL-1 (Croce et al., 1998), were 
biotinylated at Lys residues with Sulfo-NHS-LC-Biotin (Pierce). Following 
biotinylation, sPSGL was reacted with immobilized P-selectin in order to isolate 
functional material (Sako et al., 1995). A sjmthetic peptide (AnaSpec, Inc.), 
corresponding to the polypeptide portion of SGP-3 was similarly biotinylated. 

30 Biotinylated reagents were coated onto sensor chips using HBS-P buffer. 
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Glycosylated P-LE and E-LE, quantitated by experimentally determined 
extinction coefficients (280 nm), were injected over thel9ek peptide, sPSGL and 
control surfaces at 40 |uLl/min. The specificity of binding was validated by 
control experiments performed in the presence of neutralizing Mabs to P- and E- 
5 selectin, 10 mM EDTA, and soluble 19ek peptides (R. Camphausen, unpublished 
data). Deglycosylated P-LE and E-LE bound comparably to the intact, 
glycosylated versions. 

Crystallization and Data Collection . All diffraction data were collected in-house 

10 on Rigaku RU200 generators running at 5.0 KW, with Yale/Molecular Structure 
Corp. focusing mirrors and RAXIS II or RAXIS IV image plate area detectors 
except where noted. 

Plate shaped crystals of P-LE were grown at IS'^C using vapor 
diffusion from a solution containing 10 mg/ml protein, 100 mM Tris-HCl (pH 

15 8.5), 150 mM NaCl, 12 mM CaCls, 10% (v/v) 2,4 methyl pentane diol (MPD), 
and 10% (w/v) PEG 6000. These crystals were transferred into 100 mM Tris- 
HCl (pH 8.5), 75 mM NaCl, 10 mM CaCls, 10% (v/v) MPD, and 11% (w/v) 
PEG 6000, then transferred for two hours to the same buffer diluted by 5% 
(v/v) with MPD. After a second 5% dilution with MPD the crystal was soaked 

20 for a 13 hours prior to flash cooling in liquid propane held at liquid nitrogen 
temperature. P-LE complexed with SLe^ (SLe^-p-O-methyl, Toronto Research 
Chemicals) was obtained using the same methods but with the addition of 8 mM 
SLe^ to the final soaking solution. The space group of the P-LE crystals was P2i 
with cell parameters a=81.0 A, b=60.8 A, c=91.4 A, and beta=103.6\ 

25 Soaking crystals in SLe^ reduced the maximum resolution to 3.4 A, increased the 
mosaicity to 1.5° and gave cell parameters a=81.1 A, b=60.5 A, c=91.4 A and 
beta= 103.3 \ Diffraction data were processed and scaled with 
DENZO/SCALEPACK (HRL Research, Inc.) giving the statistics reported in Table 
1. 
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Large, rod shaped crystals of E-LE were obtained using vapor 
diffusion at 18°C from a solution containing 30 mg/ml protein, 100 mM HEPES 
(pH 7.5), 10 mM Tris-HCl, 200 mM CaClaand 15% (w/v) PEG 4000. Crystals 
took several weeks to grow and were more reproducible with the use of 
5 macroseeding. For complexes with SLe^, E-LE crystals were transferred to a 
solution containing 100 mM HEPES (pH 7.5), 200 mM CaCls, 30% (w/v) PEG 
4000, and 15 mM SLe^ for 15 hours at 25 ''C. After this initial incubation, 
crystals were transferred into 100 mM Tris-HCl (pH 7.4), 300 mM NaCl, 2 mM 
CaCl2, 30% (w/v) PEG 4000, and 15 mM SLe"" for an additional hour prior to 

10 flash cooling as described above. The E-LE/ SLe^ crystals belonged to space 
group P2i2i2i with cell parameters a=34.5 A, b=72.4 A, and c=77.6 A. 
Diffraction data were processed as in above. 

Large crystals of the P-LE/PSGL-1 19ek peptide (SGP-3) complex 
measuring up to 0.5 x 0.5 x 0.3 mm were obtained by repeat macroseeding into 

15 vapor diffusion crystallization drops. The crystals grew from 8 mg/ml P-LE, 

SGP-3 in a 2 fold molar excess, 10 mM Tris-HCl, 100 mM HEPES (pH 7.0), 150 
mM NaCl, 4 mM CaCla, 50 mM SrCls, 5% (w/v) PEG 6000, and 33% (v/v) MPD. 
Earlier small seed crystals were obtained from a similar buffer containing 50% 
MPD but with no SrCl2 or PEG 6000. Crystals were transferred into 100 mM 

20 HEPES pH 7.0, 10% PEG 6000, 30% MPD, 100 mM NaCl, and 50 mM SrCls for 
15 hours prior to flash cooling. A mercury-derivatized crystal was obtained by 
adding 0.5 mM mercury acetate to the final soaking buffer for 24 hours prior to 
flash cooling. The space group of the complex crystals was found to be 1222 
with cell parameters a=63.4 A, b=96.8 A, and c= 187.3 A. The crystals were in 

25 bipyramidal form. Native diffraction data were collected at Brookhaven 
National Labs station X4A using an RAXIS IV to record diffraction data. The 
mercury derivative data were collected in-house. All data were reduced as 
described above giving statistics in Table 1. 
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Structure Determination and Refinement . Crystals of E-LE complexed with SLe^ 
were found to be essentially isomorphous with those reported earlier (Graves, et 
al., 1994) with a single copy of E-LE in the crystallographic asymmetric unit. 
Rigid body refinement within CNS (Brunger et al., 1998) was used to obtain 
5 initial phases that gave clear electron density for the bound SLe^. These maps 
were further improved with the use of BUSTER (Bricogne, 1993) allowing a 
model for bound SLe^ to be fitted using QUANTA (Molecular Simulations, Inc.). 
All further refinement was performed in CNS giving a final model with 86% of 
residues in the most favored region of the Ramachandran plot and consisting of 

10 total 1510 atoms, 186 water molecules, one calcium ion, and one copy of SLe^. 
Statistics are described in Table 1. 

The structure of P-LE was solved with molecular replacement 
using the pubUshed model of the E-selectin lec/EGF construct (PDB accession 
code lESL). The program AMORE (CCP4, 1994) was used to locate all four 

15 copies of P-LE. The model was built and refined using the methods described 
above. One copy of P-LE had poorer electron density than the other three but 
with the use of non-crystallographic sjmimetry the refinement progressed well 
giving the statistics in Table 1. At the end of refinement the fourth copy of P-LE 
had an average B-factor of 63.7 compared to 44.2 for the other three 

20 copies. The final model consists of four copies of P-LE with 81% of the residues 
in the most favored regions of the Ramachandran plot, 5418 total atoms, 134 
water molecules, 2 MPD molecules and 4 calcium ions. 

The crystals of P-LE soaked in SLe^ were essentially isomorphous 
with the P-LE structure described above. After rigid body refinement in CNS 

25 there was clear density for three bound SLe^ molecules. The fourth binding site 
was partially occluded by crystal contacts, thus explaining the loss of resolution 
upon soaking in SLe^. QUANTA was used to model bound SLe^ and refit protein 
residues before Hmited refinement in CNS. This gave a final model with 71% of 
the residues in the most favored Ramachandran regions, 5455 total atoms, 3 

30 SLe^ molecules, 4 calcium ions and 2 MPD molecules. 
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All attempts to solve the structure of P-LE complexed with the 
PSGL-1 19ek peptide SGP-3 by molecular replacement failed. The heavy atom 
site was located with Patterson techniques refined in SHARP (de la Fourtelle 
and Bricogne, 1997). These phases gave poor maps but of sufficient quality to 
5 allow positioning of two lectin domains from the P-LE structure and 

independent positioning of two EGF domains. These maps also indicated the 
presence of bound SLe^ molecules in the same position as found in the P- 
LE/SLe^ complex and extra density that which could not be interpreted. The 
heavy atom phases were combined with model phases in BUSTER giving clear 

10 maximum entropy maps that allowed fitting of the SGP-3 polypeptide and SLe^ 
moieties in QUANTA. This structure was refined as described above giving 
statistics in Table 1 and 84% of residues in the most favored regions of the 
Ramachandran plot. The final model consists of two P-LE/SGP-3 complexes, 
3263 atoms, 2 strontium ions, 224 water molecules, 2 sodium ions, and 7 bound 

15 MPD molecules. 

2. Results and Discussion 

X-rav Crvstal Structure of the P-selectin Lectin/EGF Domains . We generated a 
P-selectin construct containing the N-terminal lectin and EGF domains (termed 

20 P-LE). While the E-selectin lec/EGF crystal structure (Graves et al., 1994) 

indicates that there is minimal interaction between the lectin and EGF domains 
and the putative SLe^ binding site is well removed from the lectin-EGF domain 
junction, we elected to retain the EGF domain because of studies suggesting it 
may have a functional role (Gibson et al., 1995; Kansas et al., 1994). P-LE was 

25 fused to the Fc region of IgGj via an intervening enterokinase cleavage 

sequence. This construct, termed P-LE.Fc, allowed for facile purification from 
conditioned media and the generation of monomeric P-LE domains following 
enterokinase digestion. P-LE expressed in CHO cells contains three N-linked 
glycans, which were enzymatically removed prior to crystallization. P-LE crystals 

30 were obtained in space group P2i with 4 molecules in the crystallographic 
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asymmetric unit and diffracted to 2.4 A resolution. The structure was solved by 
molecular replacement using the E-selectin lec/EGF crystal structure coordinates 
as a search model. 

The P-LE crystal structure adopts an overall conformation 
essentially identical to that of E-selectin lec/EGF structure. This is consistent 
with the 62% sequence identity between the selectins in these domains and 
results in a RMS difference of only 0.7 A for their C backbones. The lectin and 
EGF domains of P-LE interact via a small interface making the similarity with 
the E-selectin construct even more striking since this relationship is also 
maintained. The movement of several loops in the EGF domain together with a 
small movement of the interdomain angle is responsible for many of the minor 
differences between the two structures. The interdomain angle also varies 
slightly between different copies of P-LE in the crystal and therefore is likely an 
effect of crystal packing forces. 

The putative SLe^ binding site, suggested by mutagenesis and 
structure studies (Kansas, 1996), is remarkably conserved between P-LE and the 
E-selectin lec/EGF structure. The common feature of this site, and presumably 
the basis for the metal dependency of selectin function, is a calcium ion 
coordinated by the side chains of Glu-80, Asn-82, Asn-105 and Asp-106 and the 
backbone carbonyl of Asp-106. Two water molecules also ligate calcium within 
P-LE, one of which is stabilized by the side-chain of Asn-83, The similarity of the 
SLe^ binding sites is even more striking considering that the eight bound waters 
in this area of P-LE are also present (within 0.8 A) in the E-selectin lec/EGF 
structure (not shown). The binding sites do however have differences in one 
area defined largely by the change of Arg-97 in E-selectin to Ser-97 in P-selectin. 
In the E-selectin lec/EGF structure, Arg-97 stacks on Tyr-94 thereby presenting a 
positively charged surface in this region whereas in P-LE Tyr-94 is not 
obstructed by the smaller Ser-97 residue and thereby has the potential to 
mediate hydrophobic interactions. Arg-97 in the E-selectin lec/EGF construct 
makes a hydrogen bond with Asp-lOO, which is an alanine residue in P-LE. 
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Adjacent to this region is Lys-99 which, within E-selectin lec/EGF, points away 
from the binding site. The equivalent residue in P-LE is a serine residue that 
faces into the binding site. 

Crystal Structures of P- and E-selectin Lec/EGF Domains Complexed with SLe^ . 
We sought to obtain the crystal structure of P-LE complexed with SLe^, which 
we anticipated could be obtained by soaking SLe^ into preformed P-LE crystals. 
We also wished to understand the structural basis for the approximately 10-fold 
higher SLe^ binding affinity described for E-selectin relative P-selectin (Poppe et 
al., 1997), which is unexpected given the similarity of their putative SLe^ 
binding sites. Therefore, we produced an E-selectin lectin/EGF construct (E-LE) 
which was expressed, purified and deglycosylated as for P-LE. E-LE crystallized 
under conditions similar to those employed earlier for E-selectin lec/EGF 
(Graves et al., 1994) with one copy of E-LE in the crystallographic as5mimetric 
unit. 

P-LE crystals were soaked in a stabilizing solution containing 8 
mM SLe^ and crystallographic data was collected that extended to 3.4 A (Table 
1). The P-LE/SLe^ crystal structure has four copies of P-LE in the asymmetric 
unit, with one SLe^ binding site partially occluded by crystal contacts. It is 
presumably this crystal contact that causes such a dramatic loss in diffraction 
quality upon soaking. Despite the low resolution of the data, clear electron 
density could be seen extending away from the bound calcium in three copies of 
P-LE. These maps were used to construct the complex structure of SLe^ bound to 
P-LE although these results were later refined after obtaining a high-resolution 
SLe"" complex with E-LE. Initial attempts to soak SLe^ into E-LE crystals at 
concentrations of SLe^ up to 20 mM were unsuccessful. We subsequently 
determined that the high concentration of calcium used for crystallization 
inhibited E-selectin in SLe^ binding assays (not shown). Therefore, a soaking 
protocol was derived wherein the calcium concentration was reduced to a level 
which facilitated SLe^ binding and still maintained high-resolution diffraction. 
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These conditions resulted in crystals that gave diffraction data to 1.5 A 
resolution and clear electron density for bound SLe^ (Table 1). 
The structure of the P-LE/SLe^ complex reveals that the interactions are almost 
entirely electrostatic in nature and the total buried surface area is small (549 A^) 
5 when compared to the size of SLe^. Since the complex with P-LE is low 
resolution a more detailed description of the conserved interactions will be 
described below for the E-LE/SLe^ complex. The interactions of the Fuc 
hydroxyls must provide a large amount of the binding energy. The 3- and 4- 
hydroxyl groups not only coordinate the bound calcium but also form hydrogen 

10 bonds with residues that are themselves coordinating the calcium. In addition, 
Glu-107 is only 3.4 A away from the Fuc 2-hydroxyl group and may form a 
weak hydrogen bond. The SLe^ Gal residue hydrogen bonds with protein 
residues using the 4-hydroxyl group (with the hydroxyl group of Tyr-94) and 
the 6-hydroxyl group (with the carboxylate group of Glu-92). The NeuNAc 

15 residue interacts in the region where the two selectins are very different, the 
Arg-99 (E-selectin) versus Ser-99 (P-selectin) site. In P-LE, the hydroxyl groups 
of Tyr-48 and Ser-99 form hydrogen bonds to the carboxylate moiety and 4- 
hydroxyl group, respectively, of NeuNAc. Finally, it appears that C-4 of the 
NeuNAc ring packs against Pro-98. The positioning of the NeuNAc would make 

20 unfavorable contacts with Arg-99 in E-LE and so moves further back (Figure 2A) 
to allow for better interactions. The rest of SLe^ is essentially superimposible 
between E-LE and P-LE. 

The structure of E-LE complexed with SLe^ confirms the 
interactions seen in the low-resolution structure of P-LE with different contacts 

25 to the NeuNAc as a result of the Ser-97:Arg-97 (P-selectin: E-selectin) difference. 
Fuc 3- and 4-hydroxyl groups coordinate the bound calcium and form a complex 
network of hydrogen bonds with residues that also coordinate the calcium. The 
Fuc 4-hydroxyl group replaces exactly a calcium-ligated water molecule 
observed in the unliganded structure, accepts a hydrogen bond from Asn-82 and 

30 donates a hydrogen bond to Glu-80. The Fuc 3-hydroxyl group displaces 
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another calcium coordinated water molecule although its final position is now 
one A closer to Asn-105. Upon SLe^ binding Asn-83 rotates its 2 torsion angle to 
59° so that it can now donate a hydrogen bond to a bound water that in turn 
hydrogen bonds to both Fuc 2-hydroxyl group and the side chain of Glu-107 
(not shown). This rotation now also allows the Asn-83 side chain to coordinate 
the calcium. The replacement of Ser-97 in P-LE with Arg-97 in E-LE allows the 
formation of a different set of interactions with the NeuNAc and causes a 
movement of the sugar away firom the side chain to alleviate close contacts that 
would occur. The Arg-97 donates hydrogen bonds to the glycosidic oxygen and 
the carboxylate group of NeuNAc. In a similar arrangement to P-LE, the 
carboxylate group also accepts a hydrogen bond from Tyr-48. 

The protein contacts observed for SLe^ binding to P-LE and E-LE 
are in excellent agreement with site directed mutagenesis studies focused on 
defining residues important for P- and E-selectin recognition (Erbe et al., 1993; 
Erbe et al., 1992; Graves et al., 1994; Hollenbaugh et al., 1993; Ng and Weis, 
1997). Many of these mutations (e.g., substitutions of Tyr-48 and Tyr-94 in E- 
and P-selectin, and Arg-97 in E-selectin) can now be interpreted to directly 
affect hydrogen bonding interactions with SLe^. Others mutations likely disrupt 
function indirectiy by altering the orientation of interacting residues. It is this 
latter explanation which likely accounts for the disruption of function associated 
with mutations of a triple Lys stretch (residues 111-113) found in both E- and P- 
selectin. In the structures of P-LE and E-LE complexed with SLe^ Lys-113 does 
not participate directiy in binding. However, this residues does hydrogen bond 
(via it side-chain amino group) to the carboxylate moiety of Glu-92 which also 
hydrogen bonds the 6-hydroxyl group of Gal residue within SLe^, Hence, some 
mutations in this region of the selectins may disrupt SLe^ binding by an indirect 
mechanism. 

Comparing the E- and P-LE/SLe^ structures to structures of related 
lectin/glycan complexes reveals important similarities and differences. 
Examination of the crystal structure of MBP-A bound to oligomannose (Weis et 
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al., 1992) indicates a similar arrangement of calcium-binding interactions also 
involving the 3- and 4-hydroxyl groups of the ligating mannose residue. 
However, the Fuc ring in the E- and P-LE/SLe^ structures is "flipped" relative to 
mannose in the MBP-A/oligomannose complex. This results in the swapping of 
ring positions so that the Fuc 3- and 4-hydroxyl groups of the selectin LE/SLe^ 
complexes occupy the 4- and 3-hydroxyl group positions, respectively, of 
mannose in the MBP-A/oligomannose complex. Even though different hydroxyl 
groups are used and their relationship to the sugar ring differs (equatorial- 
equatorial for the mannose 3- and 4-hydroxyl groups, respectively, versus 
equatorial-axial in Fuc), the vectors along the sugar ring carbons to hydroxyl 
groups are maintained. This precise positioning of hydroxyl groups appears to 
be essential for simultaneous ligation to the calcium ion and hydrogen bond 
interactions with the protein. The crystal structure of the K3 mutant of MBP-A in 
which three selectin residues have been introduced (Ng and Weis, 1997) binds 
SLe^ significantiy differentiy than we observe here for the selectin LE/SLe^ 
complexes. While this structure and the structures presented here show Fuc 
ligation to the bound calcium, different hydroxyl groups are involved. In the K3 
mutant/SLe^ complex, Fuc 2- and 3-hydroxyl groups Hgate calcium and occupy 
the Fuc 4- and 3-hydroxyl group positions, respectively, found in the selectin 
LE/SLe^ complexes. This results in a 90' rotation of the SLe^ orientation within 
the binding pocket relative to the selectin LE/SLe^ complexes and affords a 
hydrogen bond interaction between the Gal 4-hydroxyl group and the side chain 
of Lys-111, which we do not observe. This highlights the importance of Glu-92, 
Tyr-94, and Tyr-48 (and Arg-97 in E-selectin) to the binding of the ligand in the 
E- and P-LE/SLe^ complexes. Finally, models of SLe^ molecularly docked into 
the E-selectin lec/EGF crystal structure (Graves et al,, 1994; Kogan et al., 1995; 
Poppe et al., 1997) compare favorably to our results in terms of the general 
orientation of SLe^ on the binding surface. However all disagree to varying 
degrees with the structures shown here with regard to the identity of the 
molecular contacts. With the underlying assumption that ligation of SLe^ to the 
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bound calcium mimics that which is observed in the MBP-A/oligomannose 
complex and in the MBP-A K3 mutant/SLe^ complex, all models propose that 
the Fuc 2- and 3-hydroxyl groups of SLe^ ligate the bound calcium. This is in 
sharp contrast to our observations of two separate selectin LE/SLe^ complexes in 
5 which Fuc ligation is mediated via the 3- and 4-hydroxyl groups. Other proposed 
contacts for the E-selectin/SLe^ interaction are consistent or inconsistent with 
our results to varying degrees. The design of SLe^ mimetics intended for 
therapeutic purposes based upon these incorrect structural considerations may 
ultimately limit the success of these efforts. 

10 

Generation of a Functional PSGL-1 Peptide for X-rav Crystallography . We next 
sought to obtain a crystal structure of P-LE complexed with PSGL-1 to elucidate 
the structural basis of recognition but anticipated that the physiological or large, 
extracellular forms of PSGL-1 would prove too heterogeneous for co- 

15 crystaUization attempts. Mutagenesis and biochemical studies indicate that P- 
selectin, in contrast to E-selectin, recognizes both polypeptide residues and a 
SLe^-modified O-glycan within the N-terminus of mature PSGL-1. Candidates for 
the PSGL-1 determinant include an anionic stretch of amino acids encompassing 
(numbered from the N-terminus of the mature polypeptide) Tyr-5, Tyr-7, and 

20 Tyr-10, one or more of which is sulfated, and a sialylated, fucosylated 
(presumably SLe^'-like) O-glycan localized to Thr-16 by indirect methods 
(Pouyani and Seed, 1995; Ramachandran et al., 1999; Sako et al., 1995; 
Wilkins et al., 1995). Sulfation of any one of the three Tyr residues is capable of 
supporting P-selectin binding, however, the relative role of individual tjrrosine 

25 sulfates have been inferred by rolling studies (Ramachandran et al., 1999) and 
not by rigorous affinity determinations. To explore these structural questions 
and to produce a homogeneously modified form of PSGL-1 suitable for 
crystallization with P-LE, we expressed a truncated form of PSGL-1 in a CHO 
cell line previously transfected with fucosyltransferase-VII and core2 N- 

30 acetylglucosamine, modification enzymes essential for the generation of 
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functional PSGL-1 in CHO cells (Kumar et al, 1996; Li et al., 1996). We utilized 
a soluble construct of PSGL-1 (19ek.Fc, (Goetz et al., 1997)) encoding the 19 N- 
terminal amino acids of PSGL-1 fused to the heavy chain domain of IgG^ (19.Fc, 
(Sako et aL, 1995)) in which we introduced an enterokinase cleavage sequence 
5 that would allow for the isolation of monomeric PSGL-1 peptides following 
expression and purification. 19ek.Fc coated onto latex microspheres was 
previously demonstrated to support rolling over CHO cells expressing P- and E- 
selectin (Goetz et al., 1997). 

19ek.Fc was digested v\dth enterokinase in order to produce 

10 monomeric PSGL-1 peptides (termed 19ek peptides). Preliminary analysis by 
MS suggested that the 19ek peptides were structurally heterogeneous (not 
shown). Therefore, resolution of the mixture was accomplished by anion- 
exchange HPLC which separated the 19ek peptide mixture into homogenous 
components dominated by a major species eluting late in the salt gradient 

15 (Figure lA). The structure of the major 19ek peptide was determined to be the 
sulfoglycopeptide (termed SGP-3) as shown in Figure lA. The PSGL-1 portion of 
SGP-3 is extensively post-translationally modified and includes sulfates on all 
three tyrosine residues and a SLe'^-containing core2 modified 0-glycan at Thr-16 
(Figure lA, inset). Importandy, the glycan is identical to one of two SLe^- 

20 containing 0-glycans characterized for PSGL-1 isolated from the HL-60 myeloid 
cell line (Wilkins et al., 1996). The minor species of 19ek peptides (Figure lA) 
are less modified versions of SGP-3. The structures of 19ek peptides SGP-1 and 
SGP-2 (Figure lA) were determined to be forms of SGP-3 containing one and 
two tyrosine sulfates, respectively. Preliminary analyses by MS indicated that 

25 every permutation of tyrosine svdfation is present within the hyposulfated 
species but these were not quantitated. Also present within the 19ek peptide 
pool and resolved by chromatography (Figure lA) are forms of SGP-3 
containing no tyrosine sulfates (glycopeptide-1, GP-1) or containing no 
carbohydrate (sulfopeptide-1, SP-1). 
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We used surface plasmon resonance to determine the binding 
affinities of P-LE for the individual 19ek peptides in order to select the most 
functional species for co-crystallization attempts. For comparison, we evaluated 
a soluble, recombinant form of PSGL-1 (sPSGL) consisting of the entire dimeric, 
extracellular domain. Consistent with earlier studies, P-LE bound immobilized 
sPSGL and 19ek peptides with rapid kinetics (Figure IB). Binding was 
determined to be specific based upon control experiments performed with EDTA 
and neutralizing Mabs (not shown) and with SGP-3 used as soluble inhibitor 
(Figure IB). The highest affinity interaction observed for the individual 19ek 
peptides was with the fully modified species SGP-3 with a of 778 nM. An 
essentially identical affinity was obtained with sPSGL (Figure IC) suggesting 
that the 19ek peptide SGP-3 functionally mimics the full-length extracellular 
PSGL-1 construct and that the N-terminal region of PSGL-1 present in the 
shorter construct constitutes the entire recognition region for P-LE. The binding 
affinities of P-LE for hyposulfated forms of SGP-3 were slighdy weaker (Figure 
IC) exhibiting s of 2.88 jiM for the disulfated species (SGP-2) and 12.3 |J,M 
for monosulfated species (SGP-1). P-LE binding affinities of versions of SGP-3 
containing no carbohydrate (SP-1) or sulfotyrosines (GP-1) were weaker yet 
and are estimated to be 113 |J,M and 31.1 \iM, respectively (Figure IC). The 
affinity of P-LE for a synthetic version of the 19ek peptide containing no 
carbohydrate or sulfation was considerably lower than any of the modified 
species and could not be determined at the concentrations of protein employed 
here. The binding affinity of P-LE binding to sPSGL and to SGP-3 determined 
here compare reasonably well to values previously determined for the binding of 
soluble P-selectin and a lectin-EGF construct of P-selectin similar to P-LE to full- 
length neutrophil PSGL-1 (K^ s of 322 nM and 422 nM, respectively) (Mehta et 
al., 1998). Additionally, the binding affinity of soluble P-selectin to a PSGL-1 
glycosulfopeptide modified similarly to SGP-3 was recently reported to have a 
Kp of -350 nM (Leppanen et al., 1999). 
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X-rav Crystal Structure of P-LE Bound to the PSGL-1 Sulfoglvcopeptide-3 . P-LE 
was co-crystallized with a two fold molar excess of SGP-3. Repeat macroseeding 
and replacement of calcium wi± strontium was required to give large 1222 
space group crystals that diffracted to 1.9 A. Phases were generated using 
5 mercury data collected in house, revealing two P-LE/SGP-3 complexes in the 
crystallographic asymmetric unit. The structure was refined using CNS to an R- 
factor of 20.4% (R-free, 23.5%) (Table 1). Of die 29 amino acids within the 
SGP-3 polypeptide, residues six throughlS, including sulfated Tyr-7 (Tys-7) and 
sulfated Tyr-10 (Tys-10) are observed. Disordered and absent from the structure 

10 are polypeptide residues one through five, containing the most N-terminal 

sulfated tyrosine (Tys-5), and residues 19-29 containing the enterokinase-linker 
region. All but one residue of the SLe^-modified O-glycan at Thr-16 is observed. 
Electiron density for the NeuNAc(2,3)-Unked to the Gal(l,3)GalNAc branch 
(Figure lA, inset) was not detected but this residue does not appear to be 

15 required for binding (Leppanen et al., 1999) . 

Two different views of the P-LE/SGP-3 complex provide a 
structural overview of the binding interaction from which aspects of the 
physiological P-selectin/PSGL-1 interaction may be inferred. SGP-3 binds with 
1:1 stoichiometiry to the P-LE lectin domain with a large epitope that 

20 excludesl641 from solvent and includes the SLe^ binding site described for 
the P-LE/SLe^ complex. Despite the fact that SGP-3 is disordered in solution as 
determined by transfer NOE studies (D. Tsao, unpublished observations) and 
likely extended, the bound conformation is compacted by internal folding that 
produces a hairpin-like structure. While these results do not exclude the 

25 possibility that different arms of the PSGL-1 homodimer, each supplying a 

different component of the binding interaction, might simultaneously engage a 
single P-selectin lectin domain, they suggest that a single arm of PSGL-1 is 
capable of providing a complete set of binding determinants. Moreover, this 
type of binding interaction allows for the possibility that individual monomers of 

30 the PSGL-1 homodimer might simultaneously engage two different P-selectin 
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molecules. The co-localization of two parallel binding interactions may be a 
physiological requirement for P-selectin function in light of the observation tiiat 
a dimerization mutant of PSGL-1 is incapable of supporting cell binding under 
the influence of shear (Snapp et al., 1998). Also immediately apparent from 
examination of the P-LE/SLe^ complex is that the binding of SGP-3 occurs on a 
face of P-LE essentially opposite the lectin domain:EGF domain interface. This 
provides direct evidence that the EGF domain of P-selectin does not participate 
in direct binding to PSGL-1, at least to die N-terminal portion of the molecule, 
but may have an indirect role in P-selectin function (see belov\^). 

Despite die fact that calcium was replaced by strontium, 
interactions seen in the complex of P-LE witii SLe^ are reproduced in the P- 
LE/SLe^ structure and are consistent with an earUer observation that this metal 
can effectively substitute for calcium (Asa et al., 1992). In the complex there is a 
large structural rearrangement of P-LE, which causes a change in the metal ion 
coordination. In the P-LE/SLe'' complex, the 2-hydroxyl of the Fuc makes a weak 
hydrogen bond witii Glu-107 (distance of 3.4 A). This is not present in the P- 
LE/SGP-3 complex but is replaced by a hydrogen bond with Glu-88 which itself 
also coordinates the bound strontium. In die structure of P-LE the side chain of 
Asn-83 is close to but does not coordinate the bound calcium and in E-LE 
binding of SLe"" binding of SLe"" causes it to interact with the metal ion and a Fuc 
hydroxyl. In the P-LE/SGP-3 complex, the position of tiiis residue has been 
replaced with die side chain of Glu-88. In this respect, the metal ligation now 
closely mimics that of MBP complexed with oligomannose (Weis et al., 1992) 
and the K3 mutant of MBP complexed with SLe"" (Ng and Weis, 1997) in which 
die corresponding residue Glu-193 is observed to ligate calcium. 

The interactions between the SGP-3 polypeptide and P-LE are a 
combination of hydrophobic and electrostatic contacts. The most N-terminal 
residue of the SGP-3 peptide in contact with the protein is Tys-7 followed by 
three residues in an extended conformation. Residues Tys-10 to Leu- 13 form a 
turn, which changes the direction of the peptide and the remaining residues run 
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to Pro-18 in an extended conformation. The sulfate of Tys-7 makes a series of 
hydrogen bonds with the side-chain hydroxyl groups of Ser-46 and Ser-47, a 
backbone amide nitrogen and via an ordered water molecule. In addition, His- 
114 of P-LE donates a hydrogen bond to the remaining sulfate oxygen 
5 completing the hydrogen bonding network. Tys-7 also makes a backbone- 
backbone hydrogen bond witii the amide nitrogen of Lys-112 and several 
hydrophobic interactions witii the side chains of Ser-47 and Lys-113. SGP-3 
residue Leu-8 packs against Leu-13 which are both packed against the 
hydrophobic surface of the protein formed by the side chains of His-108 and 
10 Lys-111. These interactions must help stabilize the compact tertiary structure of 
the peptide. The aromatic ring of the Tys-10 side chain lies against these two 
leucine residues and places the sulfate in a position where it can hydrogen bond 
with Arg-85 and the backbone amide of Thr-16. The interactions between Arg- 
85 and the SGP-3 polypeptide are made possible by a large structural 

15 rearrangement of P-LE (see below) and support the findings of an earlier site- 
directed mutagenesis study targeting tiiis residue within P-selectin (Bajora± et 
al., 1994). When mutated to Ala, die P-selectin mutant lost high affinity binding 
to PSGL-1 expressing cells yet retained low affinity (presumably exclusively 
SLe^-mediated) binding. Similarly, an antibody whose epitope has been mapped 

20 to the nearby residues 76-83 has been shown to abolish binding to PSGL-1 but 
not to SLe^ (Hirose et al., 1998). In this P-LE/SGP-3 complex, the side chain of 
Phe-12 in SGP-3 does not contact the protein but packs against itself in the 
other copy of the complex related by non-crystallographic symmetry. It seems 
likely that in solution it would lie against the hydrophobic surface formed by the 

25 side chain Leu-1 10. The only remaining interactions are seen with Pro-14 of 
SGP-3, which packs against His-108 and accepts a hydrogen bond from Arg-85. 

Compared to both unliganded P-LE and the P-LE/SLe^ complex, 
there are several dramatic changes to the conformation of P-LE associated with 
SGP-3 binding. The most significant conformational change is the movement of 

30 a loop of residues from Asn-83 to Asp-89 which brings Arg-85 in contact with 
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Tys-10 of SGP-3 and Glu-88 to a position where it now ligates the bound metal. 
This movement in turn appears to affect the position of residues Arg-54 to Glu- 
74 which move to occupy the space vacated by the Asn-83 to Asp-89 loop. In 
uncomplexed P-LE, Thr-65 packs against the side chain of Trp-1 excluding it 
5 from solvent. The movement of loops observed for the P-LE/SGP-3 complex 
would be expected to expose to Trp-1 to solvent but this residue rearranges to 
pack between Tyr-118 of the lectin domain and Glu-135 of the EGF domain. In 
conjunction with die movement of Trp-1 there is a 52 degree movement of the 
EGF domain relative to the lectin domain and the void created by the movement 
10 of tiie Trp-1 side chain is filled by an MPD molecule tiiat packs against the new 
position for the side chain. This raises the question as to whether the movement 
of the EGF domain is caused by the high concentration of MPD used for 
crystallization or due to binding of SGP-3. While the EGF domain does not 
appear to play a direct role in binding to PSGL-1, at least to the N-terminal 

15 region of the molecule, the change in its relationship to the lectin domain is an 
interesting observation in Ught of observations that the EGF domain may 
modulate ligand recognition (Gibson et al., 1995; Kansas et al., 1994). 

Alternatively, these studies may indicate secondary binding 
interactions between the EGF and CR domains and other regions of PSGL-1 C- 

20 terminal to the binding site evaluated here. Additional structural studies are 
warranted to further explore the potential interaction between the lectin and 
EGF domains. The internal folding of the SGP-3 polypeptide resulting in the 
placement of Tys-10 near the O-glycan at Thr-16 suggests that the relationship 
of the SLe^-modified glycan to tyrosine sulfates within the linear sequence of 

25 PSGL-1 may not be absolute above a minimum number of intervening residues. 
P-selectin may support multiple binding conformations of the PSGL-1 N- 
terminus, only one of which is shown in this study. The P-LE/SGP-3 structure 
suggests tiiat Tys-7 and Tys-10 (and not Tys-5) of PSGL-1 are essential for 
binding, a result consistent with a mutagenesis study indicating a preferential 

30 role for these specific tyrosine sulfate residues in P-selectin mediated roUing. 
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However, flexibility of the PSGL-1 N-terminus might allow different 
permutations of tyrosine sulfates, e.g., Tys-5 and Tys-7 or Tys-5 and Tys-10, to 
bind in a different register to the basic residues within P-selectin. This 
hypothesis may explain why the affinity of P-LE for the trisulfated SGP-3 is 
5 greater than for disulfated SGP-2. While it is possible that yet a third undefined 
tyrosine sulfate binding site exists within P-selectin which accounts for this 
observation, flexibility of the PSGL-1 polypeptide may permit any of the three 
tyrosine sulfate residues to bind to the two sites defined here. This may explain 
why mutations of PSGL-1 containing single tyrosine sulfate residues (including 

10 Tys-5) support P-selectin binding (Ramachandran et al., 1999). Also supporting 
this potential flexibility in binding is the observation that the corresponding N- 
terminal region of murine PSGL-1 is also critical for binding to P-selectin and yet 
contains two potential tyrosine sulfates only two and four residues removed 
firom a potential glycosylation site corresponding to Thr-16. Murine P-selectin 

15 also contains basic residues at positions 85 and 114, and based upon the 

observations here for the human P-LE/SGP-3 interaction, might be anticipated 
to make similar ionic interactions with the tyrosine sulfates within murine PSGL- 
1. A consequence of this interaction would be fewer intervening residues 
spanning the tyrosine sulfate and SLe^ binding sites. Similarly, flexibility of the 

20 N-terminal region of PSGL-1 might also permit recognition of other O-glycan 
structures attached to Thr-16. SLe^ and Le"" epitopes within a longer poly-N- 
acetyllactosamine O-glycan have been described for myeloid PSGL-1 and P- 
selectin may accommodate this in similar fashion. 

25 Concluding Remarks. The crystal structures of P-LE and E-LE complexed with 
SLe^ and of P-LE complexed with the PSGL-1 sulfoglycopeptide shown here 
contribute significantly to our understanding of the molecular basis of selectin 
recognition. The structures of P-LE and E-LE complexed with SLe^ exhibit both 
common and dissimilar binding interactions, which explain their differential 

30 affinity for this shared ligand. The structure of the P-LE/SGP-3 complex in 
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particular illustrates how the differential recognition of PSGL-1 by E- and P- 
selectin is achieved. While the primary sequences of E- and P-selectin lectin 
domains are more than 50% identical, residues critical for the P-LE/SGP-3 
interaction are unique to P-selectin and presumably account for the higher 
5 affinity of the P-selectin/PSGL-1 interaction. Residues Arg-85 and His-114, 
important for ionic interactions with tyrosine sulfate residues within SGP-3, are 
unique to P-selectin (the corresponding residues in human E-selectin are 
uncharged Gin and Leu, respectively). Interestingly, this trend in differential 
charge at these positions is observed for P-selectin and E-selectin in the mouse, 

10 rat^ rabbit and cow suggesting a conserved binding motif for P-selectin 

interactions with PSGL-1. The structural observations made here offer potential 
insights into the nature of the L-selectin/PSGL-1 interaction as well. L-selectin 
also appears to recognize the N-terminal region of PSGL-1 based upon Mab 
neutralization (Kansas, 1996) and mutagenesis (Ramachandran et al., 1999) 

15 studies. While the affinity of this interaction has not yet been determined, it is 
anticipated that it will be of intermediate affinity relative to the P-selectin and E- 
selectin interactions with PSGL-1. This is predicated upon the assumption that 
the highly homologous L-selectin lectin domain adopts a conformation similar to 
that determined for E- and P-selectin and that recognition of the PSGL-1 N- 

20 terminus shares properties with those of P-selectin. As with P-selectin, human L- 
selectin contains a basic residue at position 85 (a Lys residue) which might be 
anticipated to make an essential ionic interaction with a sulfated Tyr within the 
PSGL-1 N-terminus. However, L-selectin does not contain a basic residue at 
position 114, which could support a second ionic interaction with a tyrosine 

25 sulfate within PSGL-1 and hence binding affinity perhaps comparable to P- 
selectin. 

The interactions of the E- and P-selectins with SLe^ and the PSGL- 
1 sulfoglycopeptide shown here, based largely upon a network of hydrogen 
bonds and selected ionic interactions, are fundamental to the binding affinities 
30 and fast binding kinetics described for this class of cell adhesion molecules. The 
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rapid reversibility of relatively low affinity selectin/counterreceptor interactions 
produces attachment and rolling of leukocytes essential for their subsequent 
activation, firm adhesion and extravasation into tissues. In the context of 
inflammation, PSGL-1 appears to play a central role in selectin-mediated 
5 processes, mediating the initial attachment of neutrophils to the vascular 

endothelium as well as promoting neutrophil-neutrophil and neutrophil-platelet 
interactions likely important for amplification of the inflammatory response. The 
rational design of small molecule antagonists for the treatment of inflammatory 
conditions in which E- and P-selectin/SLe^ and P-selectin/PSGL-1 interactions 
10 are demonstrated will be assisted considerably by the structural information 
presented herein. 
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Table 2 



E-selectin SLEX 4A contacts 

5 TYR48, GLU80, ASN82, ASN83, GLU92, TYR94, ARG97, GLU98, ASN105, 
ASP106, GLU107, Bound Calcium 

E-selectin SLEX 8A contacts 

10 TYR44, SER45, PR046, SER47, TYR48, ALA77, PR078, GLY79, GLU80, PR081, 
ASN82, ASN83, GLU88, CYS90, GLU92, TYR94, ARG97, GLU98, LYS99, 
ASPlOO, TRP104, ASN105, ASP106, GLU107, ARG108, LYSlll, LYS113, Bound 
Calcium 

15 P-selectin SLEX 4A contacts 

TYR48, GLU80, ASN82, GLU92, TYR94, PR098, SER99, ASN105, ASP106, 
GLU107, Bound calcium 

20 P-selectin SLEX 8A contacts 

TYR44, SER46, SER47, TYR48, ALA77, ASP78, ASN79, GLU80, PR081, ASN82, 
ASN83, ARG85, GLU88, CYS90, GLU92, ILE93, TYR94, LYS96, SER97, PR098, 
SER99, ALAIOO, TRP104, ASN105, ASP106, GLU107, HIS108, LYSlll, 
25 LYS 113, Bound Calcium 

P-selectin PSGL-1 4A contacts 

ALA9, TYR45, SER46, SER47, TYR48, GLU80, ASN82, LYS84, ARG85, GLU88, 
30 GLU92, TYR94, PR098, SER99, ASN105, ASP106, GLU107, HIS108, LEUllO, 
LYSlll, LYS112, LYS113, HIS114, Bound Strontium 

P-selectin PSGL-1 8A contacts 

35 SER6, THR7, LYS8, ALA9, TYRIO, SERll, TYR44, TYR45, SER46, SER47, 

TYR48, TYR49, TRP50, ALA77, ASP78, ASN79, GLU80, PR081, ASN82, ASN83, 
LYS84, ARG85, ASN86, ASN87, GLU88, CYS90, GLU92, ILE93, TYR94, ILE95, 
LYS96, SER97, PR098, SER99, ALAIOO, TRP104, ASN105, ASP106, GLU107, 
HIS108, CYS109, LEUllO, LYSlll, LYS112, LYS113, HIS114, Bound Strontium 
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